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We consider the TV-user broadcast erasure channel with N unicast sessions (one for each user) 
where receiver feedback is regularly sent to the transmitter in the form of ACK/NACK messages. We 
first provide a generic outer bound to the capacity of this system; we then propose a virtual-queue-based 
inter-session mixing coding algorithm, determine its rate region and show that it achieves capacity under 
O ■ certain conditions on channel statistics, assuming that instantaneous feedback is known to all users. 

Removing this assumption results in a rate region that asymptotically differs from the outer bound by 
I> ' 1 bit as L — > oo, where L is the number of bits per packet (packet length). For the case of arbitrary 

, channel statistics, we present a modification of the previous algorithm whose rate region is identical to 



the outer bound for N = 3, when instant feedback is known to all users, and differs from the bound by 



Q\ ■ 1 bit as L — > oo, when the 3 users know only their own ACK. The proposed algorithms do not require 

any prior knowledge of channel statistics. 
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I. Introduction 

Broadcast channels have been extensively studied by the information theory community since their 
introduction in [flX Although their capacity remains unknown in the general case, special cases have 
been solved, including the important category of "degraded" channels 0. Another class of channels that 
has received significant attention is erasure channels, where either the receiver receives the input symbol 
unaltered or the input symbol is erased (i.e. not received at all) at the receiver. The class of erasure 
channels is usually employed as a model for lossy packet networks. 

Combining the above classes, a broadcast packet erasure channel (BPEC) is a suitable abstraction 
for wireless communications modeling since it captures the essentially broadcast nature of the medium 
as well as the potential for packet loss (due to fading, packet collision etc). Since this channel is not 
necessarily degraded, the computation of its feedback capacity region is an open problem. Numerous 
variations of this channel, under different assumptions, have been studied, a brief summary of which 
follows. 

For multicast traffic, an outer bound to the capacity region of erasure channels is derived in Q, in 
the form of a suitably defined minimum cut, and it is proved that the bound can be achieved by linear 
coding at intermediate nodes. The broadcast nature is captured by requiring each node to transmit the 
same signal on all its outgoing links, while it is assumed that the destinations have complete knowledge 
of any erasures that occurred on all source-destination paths. In a sense, is the "wireless" counterpart 
to the classical network coding paradigm of 01, since it carries all results of (H (which were based on 
the assumption of error-free channels) into the wireless regime. 

The concept of combining packets for efficient transmission based on receiver feedback is also used in 
|[5l , where broadcast traffic is assumed and a rate-optimal, zero-delay, offline algorithm is presented for 
3 users. Online heuristics that attempt to minimize the decoding delay are also presented. Reference O 
expands on this work by presenting an online algorithm that solves at each slot a (NP-hard) set packing 
problem in order to decide which packets to combine. This algorithm also aims in minimizing delay. 

Multiple unicast flows, which are traditionally difficult to handle within the network coding paradigm, 
are studied in Q for a network where each source is connected to a relay as well as to all destinations, 
other than its own, and all connections are modeled as BPECs. A capacity outer bound is presented for 
an arbitrary number of users N and is shown to be achievable for N = 3 and almost achievable for 
N = 4, 5. The capacity-achieving algorithm operates in two stages with the relay having knowledge of 
the destination message side information at the end of the first stage but not afterward (i.e. once the 
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second stage starts, the relay does not receive feedback from the destinations). 

A similar setting is studied in (H, where ACK-based packet combining is proposed and emphasis is 
placed on the overhead and complexity requirements of the proposed scheme. An actual implementation 
of packet XORing in an intermediate layer between the IP and 802.11 MAC layers is presented and 
evaluated in [9], while ifTOl proposes a replacement for the 802.11 retransmission scheme based on 
exploiting knowledge of previously received packets. 

This paper expands upon earlier work in ifTTI . lfT2l . (which studied the case N = 2) and differs from 
the aforementioned works in that, although it also uses the idea of packet mixing (in the network coding 
sense), it introduces additional concepts and tools that generalize the results concerning achievable rates 
to more than 2 users and provides explicit performance guarantees. Specifically, an outer bound to the 
feedback capacity region for multiple unicast flows (one for each user) is computed and, assuming public 
feedback is available, two online algorithms (named CODEl ptt 6 and CODE2 pu b) are presented that achieve 
this bound under certain conditions on rates and channel statistics. If public feedback is not available, we 
propose modifications to these algorithms that achieve rates within 1 bit/transmission of the outer bound 
asymptotically in the size of packet length. 

The algorithms do not require any knowledge of channel parameters (such as erasure probabilities) 
or future events so that they can be applied to any BPEC. They use receiver feedback to combine 
packets intended for different users into a single packet which is then transmitted. The combining scheme 
(i.e. choosing which packets to combine and how) relies on a group of virtual queues, maintained in 
the transmitter, which are updated based on per-slot available receiver ACK/NACKs. This queue-based 
coding concept has also been used in fl3l . albeit for broadcast traffic with stochastic arrivals where the 
stability region of the proposed algorithm becomes asymptotically optimal as the erasure probability goes 
to 0, whereas we consider systems with an arbitrarily fixed number of packets per unicast session where 
the capacity is achieved for arbitrary values of erasure probability. 

During the preparation of this paper, we were informed that C. Wang has independently studied in |[T4l 
the same problem as appears here and proposed coding algorithms that achieve capacity under the same 
conditions as ours. Although the two works share common ideas (namely, employing degraded channels 
to derive capacity outer bounds and performing packet coding based on receiver feedback), the proposed 
algorithms, the procedures for handling overhead, as well as the methodology used for deriving their rate 
regions, are quite different. 

The paper is structured as follows. Section|II]describes the exact model under investigation and provides 
the necessary definitions in order to derive the capacity outer bound in Section [TlTJ The first coding 



August 29, 2012 



DRAFT 



4 



algorithm is presented in Section [IVJ along with a discussion of the intuition behind the algorithm and 
a detailed example. The main properties of the algorithm are also presented. The algorithm's optimal 
performance under certain conditions on channel statistics and publicly available feedback is established 
in Section El We also present a variant of the algorithm that does not require public feedback due to the 
incorporation of overhead and determine the corresponding reduction. A modification of the algorithm 
that achieves capacity for 3 users under arbitrary channel statistics is presented in Section ED while 
Section I VII I concludes the paper. Appendices lAWGl contain most of the technical proofs. 

II. System model and definitions 

The system model is a direct extension to N users of the corresponding model in ifTTTl but is nonetheless 
repeated for completeness. We study a time-slotted system where a packet of fixed length L bits is 
transmitted in each slot. Without loss of generality, we normalize to unity the actual time required to 
transmit a single bit so that the time interval [(I — l)L IL), for I = 1,2, ... , corresponds to slot I. The 
communication system consists of a single transmitter and a set Af = {1, 2, ... , N} of receivers/users (we 
hereafter use these two terms interchangeably), while the channel is modeled as memoryless broadcast 
erasure (BE), so that each broadcast packet is either received unaltered by a user or is "erased" (i.e. the 
user does not receive the packet). The latter case is equivalent to considering that the user receives the 
special symbol E, which is distinct from any other possible transmitted packet and does not actually 
map to a physical packet (since it models an erasure). We hereafter use the term "packet" to refer to 
any sequence of L bits and the term "symbol" to refer to a packet or an erasure E (we retain however 
the standard nomenclature of "input symbol" and "output symbol", although the former is a true packet 
while the latter can also be an erasure). 

In information-theoretic terms, the broadcast packet erasure channel is described by the tuple (X, (3^ : 
i G J\f),p(Yi\Xi)), where X is the input symbol alphabet (we hereafter assume X = ¥ q , with ¥ q a 
suitable field of size q), 3i = y = X U {E} is the output symbol alphabet (where E X) for user 
i, and p{Yi\X{) is the probability of having, at slot I, output Yi = (Yii,i G Af) for a broadcast input 
symbol X\. The memoryless property implies that p(Yi\X{) is independent of I, so that it is simply 
written as p(Y\X). Since the transmitted symbols are packets of L bits, we identify ¥ g with the set of 
L-bit sequences, so that it holds q = 2 L . 

Define Zn = I\Yn = E] as the indicator function of an erasure occurring for user i at slot I, and 
consider the random vector Z% = (Zij,i € Af). The sequence {Zi}'^ l is assumed to consist of temporally 
iid vectors (we denote with Z the random vector with distribution equal to that of Zf), although, for a 
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fixed slot, arbitrary correlation between erasures for different users is allowed. For any index set X C Af, 
we define Ex = {Z% = 1, Vi G X} = C\i^x{Zi = 1} as the event that an erasure occurs for all users 
in X. We also use the convention that an intersection over an empty index set yields the entire space to 
define E = Q (the sample space). We denote ex = Pr(Ex) (so that e = 1) and, for simplicity, write e% 
instead of e^y. In order to avoid trivially degenerate cases, we henceforth assume e% < 1 for all i G Af. 

Using the introduced notation, when the transmitter at the beginning of slot I broadcasts symbol Xi, 
each user i receives symbol Y^i = ZijE + (1 — Z^i)Xi. At the end of each slot /, all users inform the 
transmitter whether the symbol was received or not, which is equivalent to each user i sending the value 
of Zij (essentially, a simple ACK/NACK) through an error-free zero-delay control channel. 

A channel code, denoted as (Mi, . . . , Mjv, n), for the broadcast channel with feedback is now defined 
as the aggregate of the following components (this is an extension of the standard definition in lfl31 to 
users): 

• message sets Wi of size |Wj| = Mj for each user i G Af, where |-| denotes set cardinality. Denote the 
message that needs to be communicated as W = (Wi,i G Af) G W, where W = Wi x . . . x Wat. It 
will also be helpful to interpret the message set Wj as follows: assume that user i needs to decode 
a given set /Q of L-bit packets. Then, Wj is the set of all possible \K-i\L bit sequences, so that it 
holds \Wi\ = Mi = 2^ L . 

• an encoder that transmits, at slot /, a symbol Xi = fi{W,Y l ~ l ) belonging to ¥ q , based on the 
value of W and all previously gathered feedback Y l ~ l = (Yi, . . . , l^_i). X\ is a function of W 
only. A total of n symbols are transmitted for message W. 

• N decoders, one for each user i G Af, represented by the decoding functions : y n — > Wj, 
so that the reconstructed symbol is Wi = g(Y™), where YJ 1 = (Y^i, . . . ,Y^ n ) is the sequence 
of symbols received by user i (including any erasure symbols E) during the n slots. Thus, the 
decoding performed by user i depends only on packets received by i, i.e. each user knows only its 
own feedback. 

Hence, a code C is fully specified by the tuple (Mi, . . . , M/v, n, (//:/ = 1, ... , n), (gi : i G Af)), which 
contains the message set size along with the encoding/decoding functions; for brevity, we will simply 
write (Mi, . . . , M^,n) to denote C. The probability of erroneous decoding for message W is X n (W) = 
PT(Ui < zj^{gi(Y i n ) ^ Wi}\W). The rate R for this code, measured in information bits per transmitted 
symbol, is now defined as the vector R = (Ri : i G Af) with Ri = (log 2 M;)/n. Hence, it holds 
Ri = \Ki\L/n. 

Let <£ be a class of (Mi, . . . , M^,n) codes. Then, a vector rate R = {R\, . . . , Rn) is achievable under 
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£ if there exists a sequence of codes (\2 nRl ] , \2 nRN ] , n) in £ such that ^ ^n(W) 

as n — > oo. Equivalently, we say that £ achieves rate R. The closure of the set of rates R that are 
achievable under £ constitutes the rate region of £. We further define a rate R to be achievable if there 
exists some class £ of codes that achieves R. Finally, the capacity region of a channel is defined as 
the closure of the set of all achievable rates, i.e. the closure of the union of rate regions of all possible 
classes of codes £ for this channel. 

The following definition, introduced in Q, will be useful in deriving the outer bound for the capacity 
of the broadcast erasure channel. 

Definition 1: A broadcast, not necessarily erasure, channel [X, (3^ : i S M),p(Y\X)) with receiver 
set N is physically degraded if there exists a permutation tt on M such that the sequence X — > ^(i) — > 
. . . — > i^jYjv) forms a Markov chain. 

A generalization to N users of the 2-user proof in lfl6l provides the following result. 

Lemma 1: Feedback does not increase the capacity region of a physically degraded broadcast channel. 
We now have all necessary tools to compute a capacity outer bound. 

III. Capacity outer bound 

Our derivation of the capacity outer bound is based on a method similar to the approaches in lfl4l . ifTTl - 
|[T9l . We initially state a general result on the capacity of broadcast erasure channels without feedback 

ma. 

Lemma 2: The capacity region (measured in information bits per transmitted symbol) of a broadcast 
erasure channel with receiver set M and no feedback is 

CnoFB = I R > : V < L I , (1) 

which implies that capacity can be achieved by a simple timesharing scheme. 

We denote with C the channel under consideration and, for an arbitrary permutation 7r on J\f, introduce 
a new, hypothetical, broadcast channel C n with the same input/output alphabets as C and an erasure 
indicator function of Z^^j = YYj=i m other words, a symbol at slot I is erased by user n(i) in 

CV if and only it is erased by all users ir(j) in channel C, with j < i, at slot I. This occurs with probability 
= e u*_ 1 {7r(j)}- A straightforward calculation reveals that it holds X — > I^-(jv) ^-(i)- Hence, 

choosing the permutation tt in Definition Q] such that 7r(i) = ir(N — i + 1), we deduce that channel CV 
is physically degraded. 
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In fact, channel C w can be viewed as an augmentation of the original channel C, where additional 
error-free virtual channels are introduced between the receivers. Specifically, each user tv({) in C n , for 
1 < i < 7Y — 1, sends its output symbol to user n(i + 1) through an error-free channel. Hence, any 
achievable rate for channel C can also be achieved for C w using the same code as in C and ignoring 
any symbols transmitted through the virtual channels. Denoting with Cfb, Ctt,fb the feedback capacity 
regions of channels C, C n , respectively, we conclude that it holds Cfb Q C-k,fb- 

The above set inclusion already provides an outer bound to Cfb- In order to derive this bound, we 
note that the previous results imply that the feedback capacity region of the physically degraded channel 
CV is identical, due to Lemma [T] to the capacity region of C n without feedback. The latter is described, 
in general form, in Lemma [2] whence the following result follows. 

Lemma 3: The feedback capacity region of is given by 

= \ R > : V : <L\. (2) 

The above analysis was based on a particular permutation it. Considering all iV! permutations on M 
provides a tighter general outer bound. 

Lemma 4: It holds Cfb Q C out = r\ ne -pCir.FB, where V is the set of all possible permutations on Af. 

The outer bound C out has been derived based on the decoding rule in Section HQ i.e. each user in 
channel C knows only its own feedback at each slot (hereafter referred to as "private" feedback). This 
raises a question regarding whether this bound is also valid for publicly available feedback (i.e. when 
each user in C knows the feedback from all other users at each slot). This question can be answered in 
the affirmative by extending the bounding arguments in the recent work of ||2T1 . which considered the 
case iV = 3 and public feedback (which corresponds to a decoding function of the form giiY^, Z 11 )), 
to general N. Since the use of public feedback simplifies the presentation of the proposed algorithms, 
we initially assume that public feedback is available and propose a coding algorithm named CODEl pu £,. 
We remove this assumption later in Section IV-AI by proposing a simple overhead scheme on top of the 
former algorithm, which leads to a new algorithm, named CODEl pr j, that only requires private feedback. 

IV. A CLASS OF CODES 

In this Section, we present a class of codes, collectively referred to as algorithm CODEl pu b (the 
index emphasizes the assumption of public feedback), and describe the basic properties that guarantee 
its correctness. 
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A. The intuition behind the algorithm 

Before the algorithm's description, a brief discussion of its underlying rationale will be useful. Since 
each user i must decode exactly the |/Q| packets in its session and a packet is an L-bit representation of 
an element in ¥ q , the transmitter transmits appropriate linear combinations of packets so that each user 
i eventually receives |/Q| linearly independent combinations of the packets in /Q. Hence, all quantities 
appearing in subsequent expressions are elements of ¥ q and all linear operations are performed in ¥ q . 

The algorithm's operation can be summarized as follows: the transmitter maintains a set of virtual 
queues Qs, indexed by all non empty subsets and properly initialized, as well as queues Qr>i 

for i G M. The queues Qd i contain copies of the packets that have been successfully received by user 
i G J\f. The algorithm processes each queue Qs sequentially; during the processing of each queue Qs, 
the packet s to be transmitted next is selected as a linear combination of all packets currently stored in 
Qs, i.e. s = XlpeQs a s{p)P, where a s (p) are suitably chosen coefficients in ¥ q . Notice that, unless a s (p) 
is non-zero for exactly one p G Qs, the transmitted packet s is not actually stored in Qs but is created 
on-the-fly. 

After transmitting s, the transmitter gets the ACK/NACKs for s from the receivers and (depending on 
which users successfully received s) potentially adds packet s into a single queue Qs>, with S' D S, 
and/or to queues Qoi, for all users i that received the packet. Some additional bookkeeping, to be 
described in detail in Section IIV-BI is also performed. The algorithm terminates when all queues Qs 
have been processed, at which point each user i can decode its original packets based on the packets 
contained in Qot- 

A central concept in the proposed algorithm is the notion of "token" which is defined as follows. 
Definition 2: A packet s is a token for user i iff s can be written in the form 

* = E 6 - ) (p)p+ c - ) > < 3) 

where # = (b { J\p), p € Id), cf G ¥ q are known to user i. We call the "coefficient vector" of 
packet s for user i. 

In words, a token for user i is any packet s that allows i, upon reception of s, to effectively construct a 

(i) (i) 

linear equation with the packets in /Q as unknowns (since by, c s are known). For efficiency reasons, 
this equation should ideally be linearly independent w.r.t. all equations constructed by user i through the 
previously received packets (equivalently, bs span({b^) : s' received by i prior to s})). In this case, 
borrowing from network coding terminology, the packet is considered to be an "innovative" token. 
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Hence, each user i must receive |/Q| innovative tokens in order to decode its packets, at which point 
the algorithm stops. Notice that it is possible, and actually very desirable for throughput purposes, for a 
packet to simultaneously be a token (better yet, innovative token) for multiple users. In the context of 
this paper, we introduce the related, but not identical, notion of a "Basis" token, rigorously defined in 
Section IIV-CI which is needed for the proof of the algorithm's correctness and its performance analysis. 
However, we will still use the notion of "innovative" token to gain some intuition into the algorithm. 

An important remark that follows from the previous discussion is that a token for user i may not only 
be non-innovative for i, but it may actually "contain" no packet intended for it, i.e. bi (p) = for all 
p e K%. For example, consider the case where the transmitter sends a packet s = p\, where p\ G K,\ and 
s is received by user 2 only. Using the delta Kronecker (5 m n notation and setting op = (S PiPl : p £ JCi), 
(fP = 0, = and cp = Pi, it is easy to see that s is a token, according to Definition |2j for both 
users 1, 2 and none other. However, packet s contains no packet intended for 2, so that one could deduce 
that this slot was "wasted". Of course, this is not actually the case (i.e. the slot was not really "wasted") 
since user 2 gained some side information, so that the question now becomes how to optimally exploit 
the side information obtained through overhearing. 

A distinctive characteristic of the proposed algorithm is that it efficiently exploits such cases (where 
users receive packets that are of no direct interest to them) by placing the packets into proper queues 
instead of discarding them. This results in better opportunities for efficient packet combinations in the 
future by creating simultaneous innovative tokens for multiple users and essentially compensating for 
previously "wasted" slots. The crux of the algorithm is in the careful bookkeeping required to handle 
these cases in an efficient manner and ensure that all users eventually receive the necessary number of 
innovative tokens. 

The following proposition, which establishes that any linear combination of tokens is a new token (not 
necessarily innovative), will be useful. 

Proposition 1: Consider a set of packets in a queue Q and a set of users S such that each packet 
p G Q is a token for all users i G S. Then, any linear combination s = X^gq a s(p)p of the packets in 
Q is a token for all i 6 5, provided that a s (p) are known to all users i G S. 

The above proposition is easily proved by noting that each packet/token p G Q for user i G S can be 
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Algorithm CODEl pub 



1: initialize Qs, K s for all S C J\f and i G 5 and Qd,, Kn z for all i G A/"; 

2: t <r- 0; 

3: for 1,...,A do 

4: for all (Qs with |«S| = £ do) > arbitrary order of processing 

5: while (K s (t) > for at least one i G 5) do 

6: compute suitable coefficients (a s (p),p G Qs); 

7: transmit packet s = X] P eQ s as (p)P' 

8: a Pply procedure ACTFB1 based on receiver feedback for s; 

9: t<-t + V, 

10: end while 

ll: end for 

12: end for 



Fig. 1. Pseudocode for algorithm CODE1 



pub- 



written as p = Ylmtic bp (u)u + Cp \ whence it follows 



6i 4) (u) 

so that s is still a token for each i G <S. 



B. Description of algorithm CODEl pu b 

Algorithm CODEl pu 6 is succinctly described in pseudocode form in Fig.[T] Specifically, the transmitter 
maintains a network of virtual queues Qs, indexed by the non-empty subsets S of J\f, as well as queues 
denoted as Qd^ for i G N. Fig. |2] provides an illustration for 4 users, where an oval box represents 
Qs, for the corresponding set S appearing as the box label, a square box represents queue Qn. and the 
vertical lines are used to classify the queues into "levels", as will be explained below. The solid (dotted) 
line arrows indicate potential packet movement into a queue Qs (Qd,)- For graphical clarity, Fig. |2] only 
shows the packet movements originating from queues Qm, <2{i,3}; however, similar packet movements 
are allowed for the other queues, as will be explained soon. 
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Ek„ H SI S 




level 3 



Fig. 2. Transmitter side virtual queue network for 4 users. Each oval box represents the queue indexed by the corresponding 
subset of {1, 2, 3, 4}, while solid and dotted line arrows indicate potential packet movement between the queues. 



1 ) Initialization: All Q £>. queues are initially empty while Q$ are initialized with the unicast packets 
as follows: 

{ Ki if £ = {»}, 
Qs(0) = { (5) 
otherwise. 

The performed initialization guarantees that all packets placed in queues Qn\ are tokens for user i G M 
according to Definition [2] 

Additionally, the algorithm keeps track of non-negative integer indices Kp^ K % s . The former are 
associated to queues Qd^ f° r a H * e ^ while the latter are associated to queues Qs, for all S C A/", 
i £ S. The indices A'd i are initialized to for all i € A/", while K l s are initialized as 

Kt\ ifS = {i}, 

(6) 

) otherwise. 

The entities Qs, Qd,, K % s , Kd. will be dynamically updated during the algorithm's execution (depending 
on the exact ACK/NACKs reported by the users), which is why we placed an explicit time dependence 
in ([5]), ©. In fact, the following note on notation will be useful: we write K % s to refer to the index when 
the exact instant at which the index is examined is unimportant (this is akin to using a variable name in a 
programming language: although the contents of the variable may change over time, we can always refer 
to the variable by name). We write K l s {t) when we specifically refer to the value of the index at time 
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t. Furthermore, since the values of these indices depend on the erasures that occur, K % s (t) is actually a 
random variable. We will use the notation Kg{t) (or K s when time is unimportant) when we want to 
emphasize the random nature of the indices. 

For each user i G TV, the algorithm also keeps track of subsets, denoted as Bg (for S C TV with i 6 S, 

(i) 

and Bo t , for i G TV), of coefficient vectors b: s of tokens s for user 2 stored in Qs, Qd; (respectively). 
These coefficient vector sets, which will be seen to have the important property that they can be selected 
so that their union forms a basis for vector space Fg for all i G TV, are initialized as Bd z (0) = for 
i G TV and 

/„•-, standard basis(¥^) if <S = H\, 

4 ?) (0) = { - ' q (7) 

otherwise, 

where the standardjbasis of an |/Q| -dimensional vector space is the set of vectors which have all 
components equal to zero except for the i-th component, which is set to one. 

2) Encoding: We define as "level £" the groups of all queues Qs with \S\ = £. The algorithm operates 
in N phases so that in phase I, with 1 < £ < N, only transmissions of linear combinations of packets 
in one of the queues in level £ occur. Specifically, at phase £, the transmitter orders the queues in level £ 
according to a predetermined rule, known to all users (say, according to lexicographic order of the index 
set S, which corresponds to the top-to-bottom ordering shown in Fig. The transmitter then examines 
the first, according to this order, queue Qs and transmits a packet s that is a linear combination of all 
packets in Qs, i.e. 

s = a s {p)p. (8) 

p&Qs 

We slightly abuse parlance and say that "s is transmitted from Qs", although it is clear that s is not 
actually stored in Qs but is created on-the-fly. Proposition Q] guarantees that s is a token for all users 
i G S, provided that all packets p G Qs are also tokens for all i G S. 

The exact generation method for a s (p) is unimportant as long as two general criteria are met. 

Criterion 1: The procedure for generating a s (p) is known to all users, so that they can always reproduce 
the values of a s (p) even when they don't receive the packet s. This implies that the receivers also know 
the size of all queues Qs, S C TV, at all times. 

Criterion 2: Assume that at the beginning of slot t, there exist (possibly empty) sets of vectors 
BdM ^ { b p ] ■ V e QdM}< for a11 * e AT, and B$\t) C {bf : p G Qx(t)}, for all Z C TV 
and i 6 1, with the following properties (note that for t = 0, these properties automatically hold by 
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selecting B%\o), S A (0) according to ©): 

\Bf{t)\ = K x (t) and \B Di (t)\ = K Di (t), 

&dM u U b t^> is a basis of for a11 * G ^ (9) 

X-.XCM 
Kj(t)>0 

and, for each i G 5 with K l s {t) > 0, we pick an arbitrary hi G <b| • (£)• Then, the generating algorithm for 
a s (p) should return as output any (a s (p) : p G Q$) such that the transmitted packet s = J2 p eQ s a s(p)p 
has a corresponding coefficient vector b$ with the property 

{&W} U B Di (t) U (J - {6J is a basis of Ff'l for all i e 5 with K s (t) > 0. (10) 

l:XCAf 
K*.(t)>0 

The above Criteria should be interpreted as two tests that any generating algorithm should pass, and 
conformance to these criteria is what the term "suitable coefficients" appearing in line 6 of Fig. Q] 
actually means. It is important to note that Criterion [2] is essentially a conditional result: it requires that 
the generator of a s (p) returns an output that satisfies (fTOt provided that there exist sets B% • (i), Su 4 (t) 
that satisfy (O, without making any claims about the actual existence of these sets in the first place. 
It will be shown later (Lemmas [51 [6) that there actually exist sets Bj -(t), BoXt) that satisfy Q and, 
furthermore, there always exist (a s (p) : p G Qs) that satisfy ( fTOb for L > log 2 N. 

Of the two Criteria, the second one is clearly the more difficult to satisfy. It will be shown that if 
coefficients a s (p) are selected so as to satisfy both Criteria, all users in will eventually receive a 
sufficient number of packets to individually solve a linear system that has a full rank matrix w.p. 1. 
Criterion 2 can be relaxed so that the generator of a s (p) returns output that satisfies ( fTOl ) with probability 
arbitrarily close to 1 ; this choice leads to a simple generator for a s (p) based on random selection. Both 
variants of Criterion [2] can be satisfied by choosing a sufficiently large field size q; however, for ease of 
presentation, we only consider the case where ( fTOl is true w.p. 1. 

3) Feedback-based actions: Once the linear combination s, in the form of ([8]), is transmitted from Qs 
at slot t and the transmitter receives the corresponding feedback from all users, the following actions 
(or steps), collectively referred to as ACTFB1, are taken (all 4 cases must be examined, since they are 
not mutually exclusive). We denote with Q the set of users that successfully received s and omit the t 
dependence from all K l s indices. 
ACTFB1 actions: 

1) if no user in Af receives s, it is retransmitted. 

2) if it holds Q C S and Kg = for all i G Q, then s is retransmitted. 
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3) for each user i € S that receives s and satisfies K l s > 0, K l s is decreased by 1 and Krj t is increased 
by 1. 

4) if s has been erased by at least one user i € S and it holds Q n (M — S) / 0, then 

• packet s is added to queue Qsug- 

• for each user i 6 5 that erased s and satisfies K l s > 0, K l s is decreased by 1 and K Su g is 
increased by 1. 

No new coefficients are produced for the retransmissions in steps 1, 2. Fig. [2] presents the permissible 
token movements from queues Qm, Q{i,3} that occur in step|4]of ACTFB1, where, for graphical clarity, 
transitions from the other queues are not shown (dashed lines correspond to step [3] of ACTFB1). Hence, 
a packet s transmitted from Qs can only be moved to a queue Q-r with T D S (a copy of the packet is 
also added to queue Qr) i if s was correctly received by user i). 

4) Algorithm termination and decoding procedure: Processing of Qs (i.e. transmission of linear 
combinations of packets from Qs) continues for as long as there exists at least one i G 5 with K l s > 0. 
When it holds K l s = for all i 6 5, the transmitter moves to the next queue Q$> in level I and repeats 
the above procedure until it has processed all queues in level I. When this occurs, phase £ is complete 
and the algorithm moves to phase £ + 1, where it processes the queues in level £ + 1. 

Since the session length K = (|/Q| : i 6 M) and the exact algorithm for generating coefficients a s (p) 
are known to all users before execution of CODEl pit ;, begins, the presence of public feedback implies 
that, at the end of each slot, all users individually have exactly the same feedback information as the 
transmitter. Hence, they can "replay" the execution of CODEl pu b in real time and iteratively compute 
bg , c\ for each transmitted packet through (0]) so that, by the time CODEl pu b terminates at the end 
of phase N, each user % has received sufficiently many tokens (i.e. packets stored in Qnd to solve the 
related system of equations and decode the packets in /Q. 

C. Properties and correctness of CODElp U b 

The following two Lemmas, proved in Appendices |A] |Bj respectively, contain all important properties 
of CODEl pu b, as they follow from its construction. 

Lemma 5: During the execution of CODEl pu b> the following statements are true: 
1) Any packet s that is stored in a queue Qs at slot t, with |«S| > 2, is a linear combination of all 
packets in queue Qx s (for some non-empty set Z s C S) that has been transmitted at some prior 
slot r < t and received (at slot r) by all users in set S — Z s and erased by all users in set N — S. 
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2) Any packet s stored in queue Q$ can be decomposed as s = Ylueu es IC- ^s(u)u, i.e. packet s is 
effectively a linear combination of packets destined for users in set S only. 

3) Any packet s stored in Q$ is a token for alH G S (and only these i G S). 

4) When transmitting a linear combination s from Q$ at slot t, there always exist coefficients a s (p) 
that satisfy ( fTQb of Criterion |2j provided that there exist sets that satisfy (O and it holds L > log 2 N. 

The relation L > log 2 N will be assumed for the remainder of the paper, so that all subsequent results 
(Theorems, Lemmas etc) are based on this assumption. The following result essentially shows that each 
user i G Af is able to decode its packets by the end of CODEl^'s execution. The result is proved by 
induction, using the algorithm's initialization and the fourth statement in Lemma [5J to establish the crucial 
inductive step. 

Lemma 6: Under the application of CODEl pu b, the following condition is true at the beginning of 
each slot t: there exist vector sets By (t) C {bp : p G Qx(t)}, for all X C Af and i 6 I, and 

BdM Q {bp ] ■ P G QdM}' for a11 * G M> such mat 
. \Bf(t)\ = K*.(t) and |B A (t)| = K Di (t). 
. B Di (t) U |J T: xcM B%\t) is a basis of F^* 1 for all i £ Af. 

Ki(t)>0 

The existence of the above sets motivates the following definition. 

Definition 3: A packet p is called a Basis token for user i G M at slot i iff 6p € BdX^) U 

Ki(t)>0 

Qearly, at the beginning of the slot £ en< 2 immediately after the completion of phase N, Lemma |6| 
implies (since K^-{t en d) = for all i, T) that BoXtend) is a basis of F^ 1 ', for all i G M. Hence, each 
user i has received \Jd\ linearly independent tokens (i.e. Basis tokens) and can decode its packets on 
a one-shot manner by solving the corresponding system of equations, using the Basis tokens in Bd v 
Since this result holds for arbitrary channel statistics, CODEl pu b is, in principle, universally applicable. 
In addition, no prior knowledge of channel statistics is required for its execution. 

D. Some further intuitive remarks 

In retrospect, the combination of Lemmas [51 [6] and their methods of proof give a very intuitive 
explanation to the algorithm's operation, which we provide next. The sets B^\i) contain the vectors 

(i) 

that span the subspace to which the b s vector of any packet s received by user i from queue Q% at slot 
t must belong in order to provide "useful" information to i (i.e. allow i to create an equation, w.r.t. packets 
in set /Q, from the received s that is linearly independent w.r.t all previously created equations by user 
i). This follows from the fact that, for all i G Af, any vector in span(B^\t)) is linearly independent 
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w.r.t. the vectors in Bd z it) (i-e. the space spanned by the coefficient vectors of the tokens already received 

iie- 
by user i), since the union of all these vector sets constitutes a basis of . Similarly, Kj(t) is the 

number of the elements of the basis of F^ 1 that belong to B^\t). 

Furthermore, by the algorithm's construction and Proposition [TJ only the users i G 5 can have Basis 
coefficient vectors corresponding to packets stored in Qs- This is due to Lemma [51 which states that any 
linear combination of packets in Qs contains packets that are intended for users i € S only. Similarly, 
Criterion 2 can be intuitively summarized as follows: when the algorithm processes queue Qs and selects 
a packet s = Yl,p^Q s a s {p)P f° r transmission at slot t, we should select a s (p) such that s is an innovative 
token for all i £ S with K l s (t) > 0, provided that there exist certain sets with specific properties at slot 
t. The existence of these sets is guaranteed again by Lemma [6] 

Regarding the rationale behind ACTFB1, step [3] of ACTFB1 is equivalent to saying that when user i 
receives a "useful" token at slot t (meaning that K l s (t) > so that there remain Basis tokens to receive) 
from Qs, this token should be added to Bd i (with a corresponding increase to Kf) t ), so that it becomes a 
Basis token for user i at slot t + 1. If this is not the case and there exist users, comprising set Q C J\f — S, 
who receive this packet (step|4]of ACTFB1), then the packet has become a token for users in S U Q and 
should be placed in queue Qsug- This allows the token to be simultaneously received by multiple users 
in the future and thus compensate for the current loss. Additionally, since user i can now recover this 
token more efficiently from Qsug instead of Qs, the indices K l s , K Su g should be modified accordingly 
to account for the token transition. Step [2] of ACTFB1 merely states that the packet is retransmitted when 
it is only received by users i who have already recovered from the queue all innovative tokens intended 

(i) 

for them (i.e. B s is empty). 

If K l s becomes at the end of some slot i, queue Qs is no longer useful for user i, since all linearly 
independent combinations that could be created from Qs have either been received by i or stored in 
higher level queues (due to step [4] of ACTFB1) for future recovery by i. Of course, the queue is still 
useful for any other users j € S with K s (i) > 0. 

E. An example of execution of CODElp^ 

We next provide a concrete example of execution for CODEl pil & that illustrates some of the points 
mentioned in Sections |IV-B[ HV-Ci We consider the case of 3 users with 10 packets destined to each 
of them and stored at the transmitter. We denote the sets of packets destined for user 1, 2, 3 as K.\ = 
{ui, . . . , uio}, K,2 = {v±, . . . , v io}, /C3 = {w\, . . . , wio}, respectively. We also introduce an upper index 

(23) 

notation to denote the set of users that have received a packet, e.g. u\ denotes that packet u\ was 
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received by users 2, 3 only. 

The initialization of CODEl pn b is trivial: all packets of set /Q are placed in queue Q{ij, the indices 
are initialized as Th^Oi) = 7| 2 2 |(0) = T| 3 3 j(0) = 10 (all other indices are zero) and the basis sets are 
initialized as £>j^|(0) = B^(0) = 6/31(0) = standard_basis(¥ 10 ) (all other sets are empty). We denote 
with ei the standard basis vector which has its i-th component set to 1. 

CODElp U b executes Phase 1, in which the queues Q{\}, Q{2}, Q{3} are sequentially processed in this 
order. The random erasure events that occur in each slot are shown in Table H where R/E stands for 
Received/Erased, respectively, and X denotes an unimportant value (i.e. X can be either R or E but, in 
any case, does not affect the algorithm's actions). For example, the ERE for slot 2 of Q{i} denotes a 
transmission that was received only by user 2. We also use the following conventions in Table U 

• for simplicity, we omit any slots in which the packet must be retransmitted due to steps [T] [2] of 
ACTFB1. Hence, the slot number (1,2, etc) should not be interpreted as physical time but rather as 
an ordinal indicating slots in which no retransmission was required. In other words, slots 1, 2 need 
not be contiguous in time. 

• due to the imposed order of processing, queues Q{2}> Q{3} are actually processed in slots 1 1-20 and 
21-30, respectively. The reader should interpret the rows corresponding to Q{2}> Q{3} accordingly. 

TABLE I 

CODEl pui) EXECUTION. ERASURES AND QUEUE CONTENTS AT END OF PHASE 1. 



Slot 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Qi 


RXX 


ERE 


ERR 


ERE 


EER 


ERR 


RXX 


EER 


RXX 


ERR 


Q 2 


REE 


EER 


XRX 


REE 


EER 


XRX 


RER 


RER 


XRX 


EER 


Qs 


ERE 


REE 


REE 


XXR 


ERE 


XXR 


ERE 


ERE 


REE 


RRE 



Phase 1 execution 



Queue status at end of phase 1 



Packets decoded by users 



C 1 ) „,« „.« 



1 \ ± j \ ± j 
user 1: u\ , u\ , u. 



r, (2) (2) (2) 

user 2: v$ , Vg , Vg , 



1 (3) (3) 

user y. w\ ,w\ 



Q{i,2} contents 



(2) (2) (1) (1) 



Q{i,3} contents 



(3) (3) (1) (1) (1) 



Q{2,3} contents 



(3) (3) <3) 



1(1 



(2) 



(2) (2) (2) 



Q {1,2,3} 



,(23) ,,(23) ,,(23) ,,(13) ,,(13) ^(12) 



•'lO 



Basis sets at end of phase 1 



B ii,2} = { e 2,e4>, Bf^y = {ei,e 4 }, B^ 3} = {e 5 ,e 8 }, S[^ 3} = {e2,e 3 ,e 9 } 
B{2,3} = { e 2,e 5 ,eio}, B^ s y = {ei, e 5 , e 7 , eg}, Sj^ 2 ,3} = {e3,e G ,e 10 } 
B fi,2,3} = { e 7,e 8 }, &fi >2 ,3} = { ei o} 
Bdi = {ei, er, eg}, Bd 2 = {e3, ee, eg}, Bd 3 = {e4, ee} 
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The transmitter starts processing Qxn and sends the uncoded packet Uj in slot j. Similarly, when 
queues Q{2}, Q{3} are processed, packet vj, Wj is transmitted, respectively, in slot j. This packet selection 
policy complies with Criterion |2 Specifically, if, at slot j, the packet Uj is received by user 1, then its 
corresponding bj vector (i.e. ej) is removed from set B^ x and added to Bb x - If Uj is erased by user 1 
and received by all users in set S, then vector ej is moved from B^X to Bg . Similar actions are taken 
for packets Vj, Wj. 

The queue contents at the end of phase 1 are also shown in Table U Some packets have already been 
decoded by their respective destinations, while the rest have been distributed among the virtual queues. 
The indices at the end of phase 1 are as follows: Th 2 j = T% 2X = 2, Th 3 | = 2, Tfj 3X = 3, T| 2 2 3 | = 3, 
-^{2 3} = ^' 2 3} = 3, In 2 3} = ^ and Tf x 2 3 | = 1. For the sets Bg , the policy of sending uncoded 
packets in phase 1, combined with ACTFB1, implies that Bg{t\) (where t\ denotes the end of phase 1) 
contains the unit basis vectors corresponding to the packets stored in Q$ at the end of the phase. 

The algorithm now executes phase 2, in which the queues Q{i,2}> Q{i,3}> Q{2,3} are sequentially 
processed in this order. Criterion [2] cannot be satisfied by sending uncoded packets only, so the transmitter 
selects a proper linear combination of all packets in the queue currently being processed. Hence, the 
packet s n transmitted at slot n of phase 2 has the form s n = ^2 pe q s a Sn (p)p, where Q$ is the queue 
being processed at slot n and a s (p) satisfy Criterion [2] The erasures that occur in phase 2 and the 
corresponding ACTFB1 actions, as well as their results, are shown in Table HTl (again, the slot number 
should be interpreted as ordinal instead of actual time). 

The first 6 slots of phase 2 illustrate some of the finer points of the algorithm. Specifically, in slot 1 
of phase 2, the transmitted packet s\ is only received by user 1. Since the vector 6^, corresponding to 
packet s±, belongs to the span of the vectors {b^ : p € Q{i,2}}> it follows that b^ S span(B^ 2 x(ti)), 
i.e. bsi £ span(e2, e^). The packets received by user 1 up to now span the space span(BD 1 {ti)) = 
span({ei,e7,eg}), so that s\ brings innovative information for this user. Hence, T} 12 y which counts 
the number of innovative tokens that user 1 has yet to recover from 2}> must be decreased by one. 

In slot 2, the transmitted packet $2 is received by users 2, 3. Using a similar argument as for user 1 

(2) (2) 

in slot 1, we conclude that user 2 gains an innovative token (since b s J £ span{{b p . p € Q{i,2}} = 
span{e\, e^) and Bd 2 = {^3, Be, eg}) and the T% 2X index must be accordingly reduced. It is important 
to note that, since at the time of transmission of S2 it holds 2X > 0, S2 is also an innovative token for 
user 1. Additionally, S2 is a token for users 1, 2 (due to Lemma [5]) and 3 (since user 3 received S2), so it 
is moved to queue (5(1,2,3} • Hence, user 1 can now recover the innovative token corresponding to packet 
S2 from queue (J{i,2,3} instead of Q{i,2}> so that the Tj 12X , T} 123X indices are modified accordingly. 
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TABLE II 

CODElp U b EXECUTION. ERASURES AND QUEUE CONTENTS AT END OF PHASE 2. 



Phase 2 


Processing 


<2{1,2} 


Q{1,3} 


Q{2,3} 


Slot 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Erasure event 


REE 


ERR 


RER 


ERE 


ERE 


ERE 


REE 


ERR 


ERR 


EER 


Applicable ACTFB1 actions 


3 


3,4 


3,4 


4 


4 


4 


4 


3 


3 


3 




T {1,2}' T {1,2} 


J {1,3}' J {1,3} 


rr-i2. nn3 

J {2,3}' i {2,3} 


Index value at end of slot 


1,2 


0,1 


0,0 


1,2 


0,1 


0,0 


2,3 


1,2 


0,1 


0,0 


Queue contents at end of phase 2 


Q{1,2,3} 


(23) (23) (23) (13) (13) (12) (23) (13) (2) (2) (2) (1) 
"3 ' "6 ' "10 ' u 7 ' u 8 ' w 10 ' *2 ' *3 ' 6 4 ' 6 5 ' *6 > *7 


Basis sets for (3(1,2,3} 
at end of phase 2 


^{1,2,3} = {e3, e 6 , eio, bi^ G span(e 2 , e 4 ), bi\\ b^' G span(e 5 ,e$) 
S[ 2 ^ 2 3} = {e 7 ,e s ,b { ^ G span^ , e 4 ) , b'ff G span(e 2 , e 5 , e w )} 
®fi,2-3} = { e io,bil\b^ ,bi 3 J G span(e 2 , e 3 , eg), b^' G span(e!, e 5 , e 7 , e 8 )} 


Packets received by 1 


Ml, W7, llg, Si, S3, S7 


received by 1 


ei, e7, eg, bij G span(e 2 ,en), b{]/ G span(e2,e 4 ), bij = 


Packets received by 2 


«3, ^6, ^9, S2, S4, S 5 , S6, S8, S9 


b' 2 ' received by 2 


l(2) ^ / \ l(2) > (2) , (2) „ 

e 3 , e 6 , eg, b^ 2 ' G span(ei,e 4 ), bi/ = oi B ' = bj 6 J = 
bi^, big 5 G span(e 2 , e 5 ,ei ) 


Packets received by 3 


IU4, U>6, S 2 , S3, S 8 , Sg, S10 


b^ 3 ' received by 3 


e 4 , ee, b^' = 6j|' = 0, b^' , b£g' , G span(ei, e 5 , e 7 , e 8 ) 



Notice that, though S2 becomes a token for user 3, it is not innovative for user 3 since it holds b s . 2 = 0. 
A similar interpretation can be given for the actions in slot 3 by swapping the roles of users 1,2. 

In slots 4, 5, 6, the transmitted packets are only received by user 2, so that step 4 of ACTFB1 is 
applicable and all 3 transmitted packets are moved to Q{i 5 2,3}- By construction of the algorithm, it also 
holds b£,bQ,b£> € span(B^ 3} {h)) . Since, at the be ginning of slot 4, the vectors in <6j^3} span a 
subspace of dimension Th 3 i = 2 (due to Lemma [6]), it follows that bgj, b^ 5 , bgj are linearly dependent 
even though S4, S5, sq € <3{i,2,3}- The last statement clearly demonstrates the true meaning of sets B^(t): 
these sets contain the vectors b corresponding to tokens that remain to be received by user i from queue 
Qs at slot t. It is exactly due to the fact that the packets stored in Qs are not simultaneously innovative 
for all users i G S that the sets must be introduced in the first place. 

At the end of phase 2 (denote this time as t%), the indices for Q{\ t 2,3} are as follows: Th 2 3 | = 6, 
■^fi 2 3j = 4' Tf 1 2 3 i = 5. In phase 3, the transmitter sends linear combinations of all packets stored in 



August 29, 2012 



DRAFT 



20 



TABLE III 

CODElp U b EXECUTION. ERASURES AND QUEUE CONTENTS AT END OF PHASE 3. 



Phase 3 


Processing 


Q{1,2,3} 


Slot 


11 


12 


13 


14 


15 


16 


17 


Erasure event 


RRR 


RER 


RRR 


ERR 


RRE 


RER 


RRE 




■T~>1 rri'2 rjl3 

J {1,2,3} ! J {1,2,3} ! J {1,2,3} 


Index value at end of slot 


5,3,4 


4,3,3 


3,2,2 


3,1,1 


2,0,1 


1,0,0 


0,0,0 


Queue contents at end of phase 3 


Packets received by 1 


Ml, Ur, 119, Si, S3, S7, Sll, Sl2, Sl3, S15, S16, S17 


received by 1 


d, er, eg, bi^bi 1 ^ £ span(e 2 , e 4 ), = 
1,(1) 1.(1) |.(i) h W h W h (i) c e 


Packets received by 2 


V3, Ve, Vg, S2, S4, Ss, S6, S8, S9, Sll, S13, S14, Sis, S17 


fo' 2) received by 2 


e 3 , e 6 , e 9 , bij £ span(ei,e 4 ), b^,b^ € span(e 2 ,e 5 ,e 10 ) 

fe (2) fe (2) ■ (2) ,(2) ,(2) / „ s b (2) b (2)s 


Packets received by 3 


104, IU6, S2, S3, S8, Sg, S10, Sll, S12, S13, S14, S16 


received by 3 


e, er b (3) b (3) b (3) 

fe (3) b (3) b (3) b (3) b (3) svan ( ein b (3) b (3) 1,(3) fa (3) s 



Q {1,2,3} unt il an T indices become zero. Table UTTI shows the erasures that occurred and queue contents at 
the end of phase 3 (note that only step 3 of ACTFB1 is now applicable and the slot numbering in phase 
3 continues from where phase 2 stopped). At the end of phase 3, each user has collected 10 innovative 
tokens (i.e. linearly independent equations) and can decode its packets by solving a linear system. 

V. PERFORMANCE ANALYSIS FOR CODEl pub 

In this Section, we analyze the performance of CODEl pu b for arbitrary channel statistics and conclude 
that CODElp U h achieves the capacity outer bound of Lemma [4] (i.e. achieves capacity), provided that the 
users in M can be ordered according to a specific relation that depends on channel statistics and the 
chosen rates; this provision is shown to be true for the special case of symmetric channels, i.e. channels 
which satisfy the condition e% = ej, for all I, J with |X| = \ J\ (i.e. the probability that all users in set 
X erase a packet is a function of |X| only). 

We also consider the case of spatially independent channels (i.e. ex = ELex 6 *) w i tn (one-sided) 
fairness constraints, a notion first introduced in |fl4l . To define this notion, we assume, without loss of 
generality, that it holds e\ > . . . > and define a rate R to be (one-sided) fair iff it belongs to the 
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set IZfair = {(-Ri) • • • ,Rn) > : ^lRi > • • • > eAf^Af}- We will subsequently show that CODEl pM f, 
achieves any rate R € C out n IZfair, i-e. CODEl pu ;, achieves all achievable fair rates for the BPEC 
channel. 

The complete performance analysis for CODEl pu & is quite lengthy so, for the reader's convenience, 
we present here the main results. 

Theorem 1: Denote f % s = 2_,-Hcs-{i} T=T~ ~ — ^ or a11 5 ^ A/ with i £ S. For arbitrary channel 
statistics, the rate region of CODEl pu &, in information bits per transmitted symbol, is given by 

^codei pu6 = < R > : ^ max (/^) < L \ . (11) 
( SCM ieS J 

Outline of proof : We provide here an outline of the proof with complete details given in Appendix ICl 

Since CODEl pn b, as described in Section ITV-B I is a variable-length coding scheme (i.e. the total number 

of transmissions T* required by the algorithm is a random variable, hence unknown a priori), we propose 

the following modification to make it compatible with a fixed blocklength coding scheme that is required 

by the information-theoretic rate definition of Section [ill For a given rate vector R and fixed n, we 

create, for each user i € N, a set of packets /Q, where |/Q| = Ki{R) = \nRi\, and consider /C, as the 

intended message for user i. We then apply CODEl^ but stop at n transmissions and declare an error 

if CODEl pu 6 has not terminated yet (i.e. an error is declared if T* > n). 

Hence, the modified fixed blocklength code has a probability of error p n (e) = Pr(T* > n) = 

Pr(T*/n > 1); furthermore, using the SLLN, we can show that T* /n tends to a deterministic quantity 

T*(R) (the R dependence is due to the fact that T* implicitly depends on K = \nR~\) w.p. 1 as n —> oo. 

Hence, the information-theoretic rate region achieved by CODEl pu ;, is the set of rates R, measured in 

information symbols per transmission, for which p n (e) — > as n — > oo, which is intuitively equal to 

{R : T*(R) < 1}. To compute the rate region in information bits per transmission, we use the fact 

that each symbol contains L bits and T*(R) is a homogeneous function of degree 1 with respect to its 

argument (i.e. T*(aR) = aT*(R) for any a > 0). Appendix O provides a detailed calculation of T*(R) 

and makes the above argument rigorous. ■ 

In order to provide a general optimality criterion for CODEl^* we need to define the following set. 

TZord = \ R > : 3 permutation n s.t. V<S C J\f it holds argmax (fsRi) = argmin (7f(i)) \ ■ (12) 

Although the permutation tt in (fT2l may implicitly depend on R (as well as on channel statistics through 
f l s ) and should actually be written as ttj^, we opt to simplify the notation by henceforth omitting this 
dependence. In words, 1Z or d contains all rates R, whose indices can be rearranged according to tt so that 
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the relation in (fT2l is satisfied. Notice that lZ or d is a cone set, i.e. R € 7£ orc i implies ai? € 7£ or d for all 
a > 0. Hence, as long as there exists some non-zero <E lZ or d, the set 7£ rd intersects the boundary of 



where tt is the permutation corresponding to R € TZord via (fl2l) . we prove the following result in 
Appendix [D) 

Lemma 7: If R £ TZ or d, it holds 



This implies, through Theorem [T] that 7£ C odei b n 7£ or d = ^ • 

Theorem Q] and Lemma [7] now lead to the main optimality criterion. 

Theorem 2: The rate region of CODEl^ satisfies the relation 7^ C odei pu6 H 7£ or d = C oui n 7£ orc z = 2? 
(i.e. CODElp U ft achieves any achievable rate in TZ or d)- Therefore, if it holds lZ or d 2 C out , the rate region 
of CODElp U fe satisfies the relation 1Z C0DKlpub = C out = V, i.e. CODEl^ achieves capacity. 
More details are provided in Appendices [Dl Theorem [2] implies the following result (whose proof is 
given in Appendix 10) regarding the optimality of CODEl pu 6- 

Theorem 3: The set lZ or d satisfies the following relations: 1) lZ or d = {R '■ R > 0}, for symmetric 
channels and 2) 1Z or d 5 TZfair for spatially independent one-sided fair channels which satisfy the 
condition e± > . . . > ejv- Hence, CODEl pu f, achieves capacity for symmetric channels and also achieves 
all rates in TZf a i r D C out for spatially independent channels. 

A. Incorporation of overhead 

The previous analysis rests on two assumptions: 1) public feedback is instantaneously available to all 

(i) (i) 

users, and 2) each user i € N always knows the values of b s , c s for any packet s it receives. In order 
to remove the former assumption (so that each user need only know its own feedback), and still satisfy 
the latter requirement, the feedback information must be conveyed to the receivers by the transmitter 
at the expense of achievable rate (i.e. incorporation of overhead). In fact, the second requirement is 
equivalent to the requirement that all users know the coefficients a s (p) of any generated packet s, even if 
they don't receive it. This follows from the fact that all are iteratively computed, through I®, based 
on the selected a s (p). Hence, the second requirement is satisfied if the algorithm for generating a s (p) 



Introducing the subset of lZ or d 




(13) 




(14) 
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(see the final remarks in the proof of Lemma [5]) is available at each receiver. This eliminates the need 
for appending the coefficient vector into the packet header as was originally proposed in [221 ■ We next 
describe a simple, not necessarily optimal, overhead scheme that can be applied to the original algorithm 
CODElput, (with or without the fixed blocklength modification) and leads to a new algorithm, named 
CODEl pr j, which does not require public feedback. The latter algorithm consists of two stages, called 
"pure information transmission" and "feedback recovery", as is explained next. 

During the pure information transmission stage, a single overhead bit h\ is reserved in each packet of 
length L. Hence, the information payload contains L — l bits and the linear combinations are performed 
only over the information payload (i.e. we treat the sequence of L — 1 bits as an element of ¥ q ). The 
transmitter executes CODEl pil b normalljQ, by setting hi = in each transmitted packet and taking 
the received feedback into account according to ACTFB1. For each transmitted linear combination s 
(including retransmissions due to steps l,|2]of ACTFB1), the transmitter also creates an iV-bit group 
(/i, . . . , /jv), where is 1 or 0, depending on whether or not user i received s, and stores it into a 
feedback log. Denoting with T* the (random) number of time slots required by CODEl^ to process all 
queues, an equal number of iV-bit groups is created and added to the feedback log. Meanwhile, each user 
stores the packets it receives in a single queue in a FIFO manner since, at this point, it can do nothing 
more without additional information on the other users' feedback. 

In principle, if each user learns the exact feedback log, it will gain the same information it would have 
in the case of public feedback; hence, it can "replay" the algorithm as it was executed at the transmitter 
side and deduce the values of Cg for the packets s it received. Hence, the objective now becomes 
to multicast the feedback log to all N users in a manner that does not introduce significant overhead. 
This is performed in the second stage of feedback recovery, in which 2 overhead bits hi, hi are reserved 
for each packet. When CODEl pu b terminates (i.e. phase N is complete), the transmitter splits the entire 
feedback log into packets of length L (so that a total of \NT* / (L — 2)] packets is required, considering 
the 2 bit overhead per packet; we hereafter call these "feedback" packets) and broadcasts each feedback 

'based on the algorithm's description in Section HV-BI the reader will notice that the existence of public feedback may affect 
the exact decoding procedure at each user but does not affect the transmitter's actions in any way, since the latter always has 
access to feedback from all users. 
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packet until it is received by all usersj^ Notice that a single feedback packet actually contains the exact 
feedback that occurred in a group of [(L — 2)/N\ consecutive slots. 

Each feedback packet has its h\ bit set to 1, so any user that receives it can distinguish it from "pure" 
information packets (which had h\ = 0) received during the previous phases of CODEl pu ;,. Furthermore, 
the transmitter applies the following procedure for bit h%. The first transmitted feedback packet has h,2 = 0. 
The transmitter keeps sending this packet until all users receive it. When this occurs, the transmitter sends 
the next feedback packet by flipping the I12 bit. 

The flipping of the h% bit is necessary to guard against the following case: if a feedback packet is not 
received by all users upon its first transmission, it is retransmitted so that it is possible that a user may 
receive multiple copies of a "single" feedback packet (meaning that all these packets contain feedback 
for the same group of slots). Without any additional provisioning, this user cannot distinguish this case 
from the case of multiple feedback packets that occurred in contiguous groups of slots and happened 
to experience exactly the same erasures. This problem is solved by enforcing the rule of flipping hi 
between transmission of feedback packets that correspond to different groups of slots during the N 
phases of CODEl pu fc. 

After all log packets have been successfully received, the transmitter broadcasts a final packet with 
all bits (including hi, ^2) set to until it is also received by all users. This packet, which can be 
easily distinguished by previous feedback log packets since it differs in the h\ bit, informs the receivers 
that transmission of all relevant information is complete. The entire overhead scheme is pictorially 
demonstrated in Fig. [3] 

Assuming the order of processing Qg to be known a priori, each receiver can actually "replay" the 
execution of CODEl pu f„ up to the point for which it has received the corresponding part of the feedback 
log, since it can reproduce the coefficients a s (p) using the same coefficient generation procedure and 
linear independence checking procedure (see discussion at the end of the proof of Lemma [5] in the 
Appendix) as the transmitter. Hence, the receiver can create local copies of the transmitter side queues 
Qs and counters K l s and use d4j) to iteratively compute the by, cy values of each transmitted packet 
s. The FIFO manner of storing packets at the receiver is crucial, since it associates each received packet 
to the correct ACK/NACK group. The following result now follows from Theorem [TJ 

2 it is not necessary that any feedback packet is successfully received by all users simultaneously. During the transmission of 
the feedback log, the transmitter keeps track of which users receive a feedback packet, say by raising a flag whenever a user 
receives a packet. Hence, the transmitter need transmit a single feedback packet only until the flags for all users have been 
raised, at which point it starts transmitting the next feedback packet (resetting all flags). 
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Fig. 3. Distinguishing packets at the receivers based on overhead bits. 



Theorem 4: Under the overhead scheme described above, the rate region of CODEl pr j, measured in 
information bits per transmission, for arbitrary channel statistics satisfies the following relation 



il : £ max (Rif s ) < - L J 2 \ , (15) 

SCAf * G6 1 + (L-2)(l- £m „) 



where e max = max ie ^ a. 

7^coDEi prj approximates 7^coDEi pilb within 1 bit as L — > oo, so that the overhead-induced rate loss is 
minimal. An an example, for N = 10 and e max = 0.5 (the latter represents very poor channel conditions; 
tmax is typically much smaller), a length of L = 8000 bits leads to a rate loss of 2.5% w.r.t. 7^ C odei PU [,- 
Proof: The proof is similar to the proof of Theorem Q] for the case of public feedback, with the 
important difference that we must now also take into account the number of slots required for the 
transmission of the feedback log to all users. Based on the description of CODEl pr j, the total number of 
slots T** needed by this algorithm is 

l+[JVf*/(i-2)l 

t** = t* + ^> ( 16 ) 

i=i 
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where the first part in the above sum (i.e. f*) is the number of slots required by CODEl pu & and the second 
part is the total number of slots required to transmit the packetized feedback log (i.e. 1 + \NT* / (L — 2)] 
packets, including the termination packet), where Ni = max ie _A/ Nij, with N^i the (random) number 
of transmissions required until the Z-th feedback packet is received by user i. It is clear that Nn are 



geometrically distributed with Pr(iVj i = v) 
The following relations will also be useful. 



v-l 



(1— £j) while N\ are (temporally) iid random variables. 



Pr(iV, > u) = Pr ( (J {N hl > u} ) < £ Pr(iV M > u) = £ < _1 < N<%£, 



= x: ?m >^)<E Ne ™* = j 

Rewriting (fT6l) as 



N 



(17) 



i/=i 



i/=i 



T** f* 1 + \NT*/(L - 2)1 



+ 



n 



n 



1 



l+[JVT7(i-2)] 

E 

Z=l 



(18) 



1 + \NT*/(L - 2)] 

and using (1651 ) of Appendix ICl for the asymptotic behavior of T* /n as n — > oo, and the fact that T* — > oo 
w.p. 1 as n -4 oo, so that we can invoke the SLLN for the term inside brackets, we conclude that 

iV 2 



f**(R) = lim 

n— >co n 



lim — < 

n— >oo n 



1 + 



(L — 2)(1 — e max ) 



J^SCAf % 

(19) 



where we used (fT71) in the last inequality of the above expression. We can now apply verbatim the 
argument used in Appendix ICl (Section IC-Bb to show that the achievable rate region of CODEl pr j, in 
information symbols per transmission, is 

iV 2 



K 



CODEl 



>ri = {r > : f**(R) < l| D < R > : 



1 + 



(L-2)(l-e max ) 



(20) 



where the last set inequality is due to (QUI) . Eq. (fl"5l) follows immediately by noting that each transmitted 
packet in the pure information transmission phase (the feedback packets, although necessary for decoding, 
only carry feedback information that is independent from the actual message) has an information payload 
of L - 1 bits. ■ 



VI. Achieving capacity for 3 users and arbitrary channel statistics 

Although CODElp U b achieves the capacity outer bound of Lemma [4] for some channel statistics (namely, 
those that satisfy condition 1Z orc i 5 C out in Theorem [2), this is not always true, i.e. for certain channel 
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statistics there exist rates R £ C out that are not achievable by CODEl pu f,. This is easily verified for 3 
users as follows: consider the case of equal rates, i.e. Ri = R for all i £ {1,2,3} (which implies that 
\Ki\ = K for all i), and assume that it holds 

ei = e 2 = e 3 , 

(21) 

£{1,2} > £{1,3} > e {2,3}- 

Considering all possible permutations on {1,2,3} and applying Lemma @] yields the following bound 

C^ = !r1:r(-^— + -^ + — -i ) <l|. (22) 

I \l-ei l-£{i,2} 1-^1,2,3}/ J 

Applying (fTTT ) of Theorem Q] to the case of equal rates and using (l2Tb produces, after some algebra, 

K eq ,consi pub = {r1:R (j^— + + — ^ ) < l) , (23) 

I \l-ei 1 - £{1,2} 1 - e {2,3} i - e {l,2,3}/ J 

which implies, since — ^ 2} > - , that 7Z e q,coDEi pub C C™ 1 . This demonstrates the suboptimality 

of CODElp^. 

A more intuitive explanation for the suboptimal performance of CODEl pu b under asymmetric channel 
statistics for the 3-receiver case can also be given through the following argument (note that, for N = 3, 
the network corresponding to Fig.|2]contains only queues for sets S G {{1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, 
{1, 2, 3}}, in addition to Qd x , Qd 2 > Qd 3 )- Assume that in phase 2 of CODEl pu b, the order in which the 
queues are processed is {1, 2}, {1, 3}, {2, 3}. When the transmitter sends linear combinations of packets 
from Q{i,2}> it is quite possible that the indices Kh 2 y K?^ 2 j do not become zero simultaneously. Say 
it happens that, at some slot t, it holds Kh^At) = and K? 12 ^(t) > 0. By construction, CODEl pu b 
will continue to transmit linear combinations from Q{i,2} unt il K% 2 i also becomes 0. However, this 
introduces a degree of inefficiency, as evidenced in step [2] of ACTFB1. 

Specifically, if a transmitted packet s is only received by user 1 , step |2] will force s to be retransmitted 
until some user other than 1 receives it, essentially "wasting" this slot. We claim that there exists potential 
for improvement at this point, by mixing the packets in Q{i, 2 } with the packets in Q/i 2 3}- Clearly, the 
first two statements in Lemma [5] are still true, so that each packet stored in either <5{i, 2 } or Q[\^;i} is a 
token for both users 1,2. Combining this fact with Proposition [T] any linear combination s of the packets 
m Q{i,2}> Q{1,2,3} is a token. In fact, since it will be later shown that it is still possible to define sets 
B^(t), B Di {t) so that Lemma [6] holds, a proper selection of a s (p) allows s to become a Basis token, 
in the next slot, for both 1,2 (provided that it holds Kh 2 3} > 0). Hence, even if the packet is received 
only by 1, the slot is not wasted, since 1 recovers a Basis token. 
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Unfortunately, the previous reasoning implies that the rule of always combining packets from a single 
queue must be discarded if the objective is to achieve capacity. For N > 3, it is not even clear what 
structure a capacity achieving algorithm should have. However, for N = 3, we present the following 
algorithm, named CODE2 pit ;„ which achieves capacity for arbitrary channels, assuming public feedback 
is available. 

CODE2 pu f, operates in phases as follows. Phase 1 of CODE2 pu {, is identical to phase 1 of CODEl pu &, 
with the transmitter acting according to the rules in ACTFB1 (note that step|2]of ACTFB1 cannot occur in 
this phase of CODE2 pu ;,). In phase 2 of CODE2 pn 6, the transmitter orders the level 2 queues Qg according 
to an arbitrary rule and sequentially processes each Q$ by transmitting linear combinations from Q$ until 
it holds K l s = for at least one user i G S. When this occurs, the transmitter moves to the next level 2 
queue. Again, the steps in ACTFB1 are applied. When all level 2 queues have been processed, each such 
queue Q$ has at most one surviving user index (meaning some i £ S with K l s > 0). For convenience, 
we denote this time instant with t\ and define the survival number Su(i) of index i G {1,2,3} as 
Su(i) = \{S : \S\ = 2, K^t^) > 0}|. In words, Su(i) is equal to the number of level 2 queues which 
contain unrecovered Basis tokens for user i at time t%. Clearly, Su(i) is a random variable that depends 
on the prior erasure events (hence, the dot accent) and satisfies < Su(i) < 2 for all i G {1,2,3}. The 
transmitter now distinguishes cases as follows: 

1) if it holds Su(i) = for all i G {1,2,3}, CODE2 pub reverts to CODEl pufe , starting at phase 3. 

2) if it holds Su(i) = 1 for all i E {1, 2, 3}, CODE2 pu f, reverts to CODEl pu f, and continues processing 
each Qg queue in level 2 until all K l s become zero. 

3) otherwise, there exists at least one pair of users i,j such that Su(i) = 0, Su(j) > 0. In this case, 
simple enumeration reveals that all possible configurations of Su(l) for I € {1, 2, 3} fall in exactly 
one of the following 4 categories: 

a) there exist distinct users i*,j*, k* G {1, 2, 3} such that Su(i*) = 0, Su(j*) = 1, Su(k*) = 2. 

b) there exist distinct users i*,j*,k* G {1,2,3} such that Su(i*) = 0, Su(j*) = Su(k*) = 1. 

c) there exist distinct users i*,j*,k* G {1, 2, 3} such that Su(i*) = Su(j*) = and Su(k*) = 2. 

d) there exist distinct users i*,j*,k* G {1, 2, 3} such that Su(i*) = Su(j*) = and Su(k*) = 1. 
To provide some concrete examples, Fig. [4] contains 4 possible configurations (each belonging, from 
left to right, to one of the above categories), where circles are used to denote surviving indices. 
The values (i*, j* , k*) for each configuration are (3,2,1), (2,1,3), (3,2,1), (3, 2, 1), respectively. 

We hereafter concentrate on case 3 of the above list, since cases 1, 2 revert to CODEl ptt fc. The transmitter 
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Fig. 4. Possible states of innovative token indices Kg for the level 2 queues at epoch t\. 

now constructs the set Q^ u = {Q{i*j} '■ Su(i*) = 0, Kj^ -t.^) > 0} consisting of all level 2 queues that 
contain a surviving index j and an index i* with Su(i*) = 0. Relative order within is unimportant. 
A subphase, called 2.1, is now initiated, in which the following actions are performed: 

• the transmitter processes each queue Q{i*.j} in Qg u and transmits a packet s which is a linear 
combination of all packets in queues Qn*j\ and Q{\ t 2,S) ("and" denotes grouping in this context 
and should not be interpreted in the Boolean sense). The coefficients a s (p) are selected such that s 
is a Basis token for j as well as i* (for the latter case, this is true if it holds K'C 2 3 | > 0). It will 
be proved in Appendix |F] that this selection is always possible. Depending on the received feedback, 
the following actions, collectively referred to as ACTFB2, are taken. 

ACTFB2 actions: 

1) if s is erased by all users, s is retransmitted. 

2) if s is received only by i* when it holds K'C 2 3 , = 0, s is retransmitted. 

3) if j receives s, K-L, is decreased by 1 and Kdj is increased by 1. 

4) if i* receives s and it holds Kf x 2 3 , > 0, K 1 ^ 2 3 x is decreased by 1 and Ko* is increased by 
1. 

5) if j erases s and k € {1, 2, 3} — {i*,j} receives it, s is added to Q{i t 2,3}> K 1 ^, ^ is decreased 
by 1 and K-L 2 3 j is increased by 1 . 

Notice that, apart from stepH]) in the above list, ACTFB2 is similar to ACTFB1. The above procedure 
is repeated until it holds Kl^ = 0, at which point the next queue in Qg u is processed. The above 
procedure is repeated until all queues in Q^ u have been processed. 

• once all queues in Q^ u have been processed, the transmitter computes the new values of Su{i) for 
i G {1,2,3} and constructs Qg u from scratch. If Q^ u = 0, CODE2 pu 6 reverts to CODEl pn b starting 
at phase 3, otherwise it repeats the above procedure verbatim for the new Qd u - It can be easily 
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verified that at most 2 iterations of this procedure will be performed until it holds Qg u = 0. 

As a final comment, step |2] of ACTFB2 is similar to step |2] of ACTFB1 so one could argue that 
CODE2 pu b still performs inefficiently. However, by construction of Qg u , it is easy to verify that if, 
during the combination of Q{i*,j} £ Qg u with (5(1,2,3}, K 1 ^ 2 3 j becomes before K J ^, does, then i* 
has no more Basis tokens to recover (i.e. it holds K % s = for all S C J\f). Hence, i* cannot gain any 
more linearly independent tokens by combining Qii*j\ with <5{i,2,3} and no efficiency is lost. 

To provide a concrete example for the last statement, consider the application of subphase 2.1 to the 
leftmost configuration in Fig. |4] It holds Q^ u = {Q {1,3}, Q {2,3}} and the transmitter starts combining 
Q{i,3} w i m Q {1,2,3} unt il K% 3 | becomes 0. If it happens that 2 3 -. becomes before K^ 2 3 y, then 3 
has indeed recovered all Basis tokens so that, even if step [2] occurs, no efficiency gain is possible. The 
same conclusion is reached by examining the 3 other categories shown in Fig. |4] Hence, at the end of 
subphase 2.1, it holds K l s = for all i £ S with |<S| = 2 and CODE2 pit ft reverts to CODEl pM h starting at 
phase 3. 

The properties and achievable rate region of CODE2 pM ;, can be determined by an approach similar to 
that of CODElp,^. Specifically, the correctness of CODE2 pn h is proved in Appendix [0 where a slight 
modification of Lemma [5] is used to show that Lemma [6] is still true for CODE2 pu {,. This guarantees that 
at the end of CODE2 pn b, all 3 users have received the required number of linearly independent tokens 
and can decode their packets. The performance analysis for CODE2 pn b is identical to CODEl pu b, up to 
time t\. From this point on, the number of tokens produced during the combination of the queues in Q^ u 
with (5(1,2,3} must be carefully computed. The computation is relatively straightforward but lengthy, and 
is deferred to Appendix iGl The final result is: 

Theorem 5: CODE2 pM ;, achieves the capacity outer bound of C out , for L > 2. In case only private 
feedback is available, we can construct algorithm CODE2 pr j, based on CODE2 pn b, using the overhead 
scheme employed in CODEl pr j. The final result is that the rate region of CODE2 pr ., asymptotically differs 
from the capacity outer bound by 1 bit as L — > 00. 

VII. Conclusions 

This paper presented 2 coding algorithms, CODEl^ and CODE2 pit ;„ which achieve (assuming public 
feedback) an outer bound of the feedback capacity region of the Abuser broadcast erasure channel with 
N unicast sessions for the following cases, respectively: 1) arbitrary N and channel statistics that satisfy 
the general condition in Theorem [2] (this includes symmetric channels as a special case), and 2) arbitrary 
channel statistics, for N = 3. If public feedback is not available, a simple overhead scheme can be applied 
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on top of each algorithm, leading to a rate region that asymptotically differs from the outer bound by 1 
bit as L — > oo. The main characteristic of the algorithms is the introduction of virtual queues to store 
packets, depending on received feedback, and the appropriate mixing of the packets, without requiring 
any knowledge of channel statistics, to allow for simultaneous reception of innovative packets by multiple 
users. 

Since only an outer bound to the capacity region is known for N > 4 and arbitrary channels, the 
search for capacity achieving algorithms for N > 4 is an obvious future research topic. It is expected 
that such algorithms cannot be constructed through minor modifications of CODEl pu 6, as was the case with 
CODE2 pu 6, and may possibly require complete knowledge of channel statistics. If this is the case, adaptive 
algorithms that essentially "learn" the relevant statistics may be appropriate. Suboptimal algorithms with 
guaranteed performance bounds in the spirit of |[T3l may also be of interest. 



By construction of CODEl pu b, the only way a packet s can be stored in queue Qs, with |<S| > 2, is 
during step|4]of ACTFB1 (since, excluding packets that are received by a user i £ S and moved to queue 
QDi, no packets are moved between queues Qs in the other steps of ACTFB1). Thus, the execution of 
step |4] implies that s is a linear combination of packets in some queue Qx s , with ^ Z s C S, and s is 
received by all users in S — l s and erased by all users in N — S. This completes the proof of the first 
statement. 

For the second statement of the Lemma, we note that the algorithm's operation implies that any 
transmitted packet s is decomposed as s = Sueu ejV AC cl s (u)u (the algorithm essentially sends linear 
combinations of linear combinations etc.). Furthermore, we can combine the initialization of CODEl pM h 
(for queues Qs with |<S| = 1) with the first statement in Lemma [5] (proved in the previous paragraph) to 
show, via strong induction on |«S| = 2, . . . , N, that, for all S C M and any packet s stored in Qs, it holds 
a s (u) = for all u G K.j with j S. Specifically, any s € Qs must have entered Qs during step [4] of 
ACTFB1, so that it holds s = ^2„ £ q x a s (p)p, where Z s C S. Using the strong induction hypothesis for 
I s , we know that any p £ Q% s is written as p = Ylueu- ex ^p( u ) u - Combining the last two expressions, 
we conclude that any packet s stored in Qs can be written as 



Appendix A 



Proof of Lemma [5] 




(24) 



for suitable a s (u), and the second statement is also proved. 
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To prove the third statement of the Lemma, we apply strong induction on \S\, starting with |<S| = 1. 
Due to the initialization of CODEl pil ;„ any packet s stored in Q^y belongs to set /Q, so that s is a 
(trivial) token for user i and no other user. We now consider any s stored in queue Q$ with |«S| > 1, and 
use the first statement of the Lemma to write s = 2~2 p <=q x a s(p)P> where X s C S. This also implies that 
s was received by all users in set S — X s , so that s is a token for all users in the set S — Z s . Combining 
the inductive hypothesis for set X s with Proposition [T] we conclude that s is a token for all i G I s as 
well, so that s is a token for all i G S. To show that s is not a token for any i £ S, we combine the fact 
that s is a linear combination of packets destined for users in set Uj g x a /Cj only (second statement of the 
Lemma) with the fact that s was erased by all users i ^ S (first statement of Lemma). Hence, i cannot 
be a token for any i g" S. 

Before we prove the fourth statement in Lemma [5J we need to establish some intermediate results. 
The following Proposition is easily proved by considering the union bound for the probabilities of the 
complementary events. 

Proposition 2: For any events Aj, with j = 1, . . . , m, it holds 

m 

Pr(n^ 1 A i ) > Pr (^j) - m + 1. 

3=1 

The following result will be crucial in proving Lemma [5] 

Lemma 8: Let {vi, . . . ,Vm} be a basis set of the vector space F^ and consider a subspace U with 
dimension I > 1, which contains the set {v±, . . . ,vk}, with 1 < K < I. Then, the subspace U n 
span({v2, ■ ■ ■ , vm}) has dimension at most I — 1, and U — span({v2, ■ ■ ■ ,Vm} is a non-empty set. 
Additionally, for any vector u <EU — span({v2, ■ ■ ■ , »m}, the set {u, V2, ■ ■ ■ , vm} is a basis of . 

Proof: We use contradiction to show that dim(U D span({v2, . . . , v m}) < I — 1. Specifically, 
assume that dim(U n span({v2, ■ ■ ■ , vm}) = I- Then there exists a set {vi, . . . , v{\ which forms a basis 
of U nspan({v2, ■ ■ ■ ,vm})- Therefore, {vi, . . . ,i>i} C Unspan({v2, ■ ■ ■ ,«m}) is a basis of U as well, 
since it is a linearly independent set of cardinality I that is contained in the subspace U of dimension I. 
The basis property for U now implies that v\ € span({v\, . . . , vi}) and, since Vi € span({v2, ■ ■ ■ , «m}) 
for 1 < i < I, it also holds v\ G span({v2, ■ ■ ■ , This contradicts the assumption that {vx, . . . , By} 

are linearly independent and proves the desired result. Additionally, since v\ span({v2, . . . ,%}), it 
also holds Xv\ G U — span({v2, . . . , ^m}) for all A G F g — {0}, so that U — span({v2, . . . , i>m}) / 0. 

In order to show that {«, t>2, . . . , «a/} is a basis of F^ f for anj u G U—span({v2, ■ ■ ■ , i>m}), it suffices 
to show that {it, i>2, • • • , i>m} is a linearly independent set. Indeed, pick any it G U—span({v2, . . . , «m}) 
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and assume that there exist a, Ci <E F q such that 



M 
i=2 



+ 22 C i V i = °- 



Then it must hold a = 0, since the case a ^ implies that u € span({v2, • • • , «m}). which is impossible 
by the selection of u. The condition a = now implies q = 0, due to the linear independence of 
{i>2, • • • , vm}, so that {u, i>2, • • • , vm} is also linearly independent and the proof is complete. ■ 
The last intermediate result we need before proving the fourth statement in Lemma [5] is provided 
below. 

Lemma 9: Let vj, with j = 1, . . . , k, be vectors in F q ! . Denote V = span({vj},j = l,...,k) and 
/ = dim(V), with / > 1. Let ay, with j = 1, . . . , k, be independent random variables uniformly distributed 
in F q and construct the random vector v = Ylj=i a j v j- Then, v is uniformly distributed in V, i.e. 

p r ( u = e ) = VeeV. 

<r 

Additionally, let {6 1( . . . , b M } be a basis of Fff and assume that {bi, . . . , 6^} C V for 1 < K < M. 
It then holds 

Pr ({v, b 2 ,..., b AI } is basis of Fj J ) > 1 - ^ . 

Proof: Since V has dimension I, we can pick / vectors vi (out of the k available) as a basis for V; 
without loss of generality, we can permute vector indices so that the basis set is {v\, . . . ,v{\. Hence, 
v can be written as v = Y?j=i a j v j + 9> where g = Yl!j=i+i a j v j * s a random vector independent 
from Y^j=i a j v j- Furthermore, any vector e e V can be written uniquely, through the basis set, as 
e = Y?j=i e j v j- I* now holds 

Pv(v = e) = Pr a j v j + 9 = e 

rev \j=i 

= 12 Pr \Y1 a i v i = e ~ r 

rev \j=i / (25) 

= 2 Pr X! W = Zl( e J " r i) v i Pr (# = r) 
rev \j=i j=i J 

1 1 

= J2 Pr ( n j=i{«i = e J - r i}) Pr (» = r ) = J2 - Pr ^ = r ) = -' 

rev rev q q 

where we used the independence of Y^!j=i a j v j from 9 to remove the conditional probability and exploited 
the facts that {v±, . . . , vi} is a basis set for V and <x, are independent and uniformly distributed in F q . 
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To prove the second part, we note that 

Pr ({«, 62, ... , b M } basis of Ff ) = 1 - Pr(u G span({6 2 , . . . , b M }))- (26) 

For notational convenience, denote M. = span ({62, • • • , &m}). It now holds 

Pt(v eM)= Pl '( v = r) = -\Mr\V\, (27) 

rexnv q 

where the last equality is due to the uniform distribution of v in V. For all vector spaces over a finite field, 
it also holds |A4nV| = q dim (Mnv) < g cMV)-i = q f J-i > w here the inequality is due to LemmaHJ Inserting 
this inequality into d27l ) produces Pv(v G A4) <l/q, whence the desired result follows immediately. ■ 
We are now in position to prove the fourth statement of Lemma [5] Specifically, recalling the notation 

of Criterion |2j we assume that there exist sets B% (t), B^Xt) such that B = Bu^t) U [Jz:ZC/f B x 1 (t) 

ifi(t)>o 

is a basis of F^ 1 ' for all i G M. Assuming that CODEl pu ;, is currently processing Qg, define the set 
TZs(t) = [i G S : K s (t) > 0}. We need to show that if, for each i G 1Zs(t), we pick an arbitrary 
vector hi G B$ (t), then there exists a coefficient vector a s = (a s (p),p G Qs) such that the vectors 
bi = YlpeQs a s{p)bp \ corresponding to the combination s = Yl p eQ s a s{p)p, satisfy the following 
condition 

{bf}U% w U \J Bf(t) - {bi} is basis of F^ 1 Vz G K s (t). (28) 

Z.-ZCJV" 
Ki(t)>0 

The proof is via a standard probabilistic argument. Specifically, consider the case where coefficients 
a s are iid randomly generated according to a uniform distribution in ¥ q . For a given user i G lZs{t), 
define the event A { = {{#} US - {6<} is basis of Ff " ' I , whence it follows from Lemma [9] that 
Pr(Aj) > 1 — 1/g. Applying Proposition [2] to the event r\ e -ft s m A, yields 

(n ie ^ sW A t ) > |tt 5 (t)| (1 - J) - l^(*)l + 1 > 1 - - 1 " 7' (29) 

Selecting q > N (since q can be as large as 2 L , the condition q > N can be satisfied if L > log 2 iV) 
results in a strictly positive probability, which implies that there exist some vectors that simultaneously 
satisfy (l28l) for all i G TZs(t). This completes the proof of the fourth statement in Lemma [5] 

The previous analysis suggests the following alternative approach to an exhaustive search for generating 
a s (p) in accordance with Criterion [2] If the sets B%\ Br>i are actually stored at the transmitter and 
receivers, and since Lemma [5] ensures that, for q > N, there exist coefficients a s (p) which satisfy ( fTOb 
of Criterion [2j then a s (p) can be generated randomly and uniformly in ¥ q (so that d29l) holds) followed 
by an explicit check by the transmitter whether the generated vectors b^ indeed satisfy d28l ). If (|28T ) is 
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violated for at least one i 6 TZs(t), new coefficients are repeatedly created until the condition is satisfied 
for all i G TZs(t). Only then is the packet s actually transmitted, using the most recent coefficients a s (jp) . 
The average number of trials required to find the suitable coefficients is easily computed as (1 — N/q)" 1 . 

Appendix B 
Proof of Lemma [6] 

Proof is by induction on t. At the beginning of slot t = 0, we can satisfy all conditions by choosing for 
each i € AT as follows: B D M = > B^(0) = for 1 ^ {i} and B^(0) = standard_basis (f^'Y 
We now assume that the inductive hypothesis is true at the beginning of slot t and the queue currently 
being processed is Qs- We construct TZs{t) = {i £ S : K l s {t) > 0} and further assume w.l.o.g. that 
7ts(t) 7^ since, in the opposite case, CODEl pu 6 will skip processing Qs and continue to the next 
queue. Lemma [5] now guarantees that, due to the validity of the hypothesis (i.e. the existence of B^{t), 
Bf) at the be ginning of slot t, we can select vectors hi S Bg\t), for each i € lZs(t), and coefficients 
a s (p) for the next packet s to be transmitted from Qs so that (fTOb of Criterion [2] is satisfied for all 

* e n s {t). 

For each i € TV — TZs(t), it either holds i £ S or i G S, K' s {t) = 0. In both cases, by construction of 
ACTFB1, the transmission of s does not change any of the K\, K^. indices. Hence, at the beginning of 
slot t + 1, we can select B^\t + 1) = B^\t), for all ICJV, and B Di (t + 1) = B Di (t) so that, for all 
i € M — lZs{t), the inductive hypothesis holds for t + 1 as well. We now concentrate on i € 1Zs{t) and 
consider the following mutually exclusive cases: 

• if i receives s, ACTFB1 forces s to be added to Qjj. and K l s to be decreased by one, while Kjj. is 
increased by one. Accordingly, we select Bf(t+1) = Bf(t)-{bi} and B Di (t+l) = B Di (t)U{b^}, 

(i) n 

while all other sets Bj- remain unaffected. Lemma [5] now implies that the union of the new sets at 
slot t + 1 form a basis of Fg" 1 '. 

• if i erases s and all users in a maximal set Q C M — S receive s, then K l s is decreased by one and 
K l Su g is increased by one, according to ACTFB1. We now select Bf{t + 1) = Bf{t) - {6J and 
Bgyjgit + 1) = B^ljg(t) U {bi^ } while all other sets remain unchanged. Lemma [5] again implies that 
the new sets form a basis of Fj^*' at t + 1. 

• if i erases s and the only users that receive s belong to a set Q C S, no K % x , Kd 4 indices are affected 
so that no sets need be changed. In this case, the inductive hypothesis holds trivially at t + 1. 

Since the above list contains all possible cases, we conclude that the hypothesis is true at the beginning 
of slot t + 1 and the proof is complete. 
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Appendix C 
Proof of Theorem Q] 

A. Some auxiliary results 

We first need to establish some additional notation and intermediate results. Denote with Rg = {Zi = 
0, Vz € Q} the event that all users in set Q receive the transmitted packet, whence it follows from De 
Morgan's law that 

Rg= 1+J (E H nRg^n), (30) 

where c stands for set complement and |+J denotes a union of disjoint sets. For completeness, we define 
E = R = Q (the sample space). Introducing the quantity ps,g = Vx{E$ n Rg) for all disjoint 
S, Q C M, we can use ([30]) to convert the expression Pi(Es) = Pr(£ , 5 n Rg) + Pr(£ , 5 n Rg) into 

Vt(E s ) = Pr(E s n Rg) + ^ Pr(E S uH n Rg-%) ps,g = e S - J] PSu*H,g-H- (31) 

H^0-.Hcg H^0:HCg 

Evaluating the last relation for arbitrary S and Q = {j}, with j S, yields 

Ps,{j} = £5 - e.su{i}- (32) 

The following result provides a general expression for ps,g- 
Lemma 10: For any non-empty disjoint sets S, Q C A/", it holds 

P5,e= (33) 

Proof: Proof is by strong induction on |£7|. Specifically, for arbitrary S and \Q\ = 1 (say, £ = {j}), 
(l33l) becomes 

Ps.tf} = I] (-l) |W| e 5 uw = (-1) O £5U0 + (-l) 1 ^-}, (34) 
HC{j} 

which is identical to (l32l) . We now assume that (l33l) is true for a// 5 and all Q with |C/| = 1, . . . , I and 
show that (l33l) is still true for a// S and a// with \Q\ = I + 1. Specifically, we can write Q = {i} U Q 
where i $ Q and \Q\ = I, so that we only need to show 

P St g=Ps,gu{i}= Yl (- 1 )' W ' e 5u«- (35) 
Hcgu{i} 
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Since any subset H of Q U {i} is either a subset of Q (and therefore does not contain i) or (exclusive 
or) % contains i and a, possibly empty, subset H of Q, the sum in (I35T ) can be written as 



(36) 



(37) 



(38) 



wee ^cg 
= J] Vsvjh - esu{i}uu] • 

However, it also holds 

Ps,gu{ l} = Pr(# 5 n n R g ) = Vr(E s n ifc) - Pr(£ 5 n n Rg) 
= ¥t{E s n JSg) - Pr(E su{i} n = p s ,g - Psu{i},g, 
Since |C/| < I, the inductive hypothesis holds for ps,g, Psu{i}.g> whence we conclude that 

PS,Q = ^2 (-1) |W| £5UW, 
ncg 

Psu{i},g = y^(-l)^^5u{i>u?i- 

Inserting (l38l) in (l37l ) immediately produces the RHS of (l36l ) and the proof is complete. ■ 
An immediate consequence of Lemma [10] is the following result. 

Corollary 1: For any 5 C JV with i € S, the probability that a transmitted packet is received exactly 
by all users in S — {i} (and none other) is given by 

PAf~(S-{i}),S-{i} = ^2 ( _1 )' W ' e (Ar-(5-{i}))UW- 
HCS-{i} 

For the next auxiliary result, we need to introduce some further notation. Consider some given n, 
R > and the application of the original CODEl pu b (i.e. without the fixed blocklength modification) for 
K = \nR~\ packets. We hereafter use consistently a dot accent to explicitly denote a random variable. 
We denote with T* s the number of slots (viewed as a random variable due to the random erasures) it 
takes under CODEl^ for index K l s to become during the processing of queue Qs, while (resp. T*) 
denotes the number of slots it takes under CODEl pit fe to process queue Qs (resp. all queues). Hence, it 
holds 

Tj = max f* s , 

•* (39) 
T* = \ TJ. 

Due to the random erasures, the time- varying index K s (t) is a random process. We denote with ts 
the time when processing of queue Qs begins and define the random variable k % s = K s (ts) so that, by 
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the algorithm's initialization, it holds kl^ = Ki w.p. 1. By construction of CODEl pu b, when queue Qs 
is processed at slot t by transmitting a linear combination s, index K s {t) is reduced by one (assuming 
that Kg{t) > 0) only if s is received by at least one user in set N — {S — {i}) (i.e. received by either i 
or at least one user in TV — S). Denoting with , the number of slots in the time interval between the 

n 

(Z — l)-th and the l-th reduction of index Kg during the processing of Qs, it clearly follows that] 

T*s = J2^h (4°) 

i=i 

where N s 2 are iid geometric random variables with Pr(N s l = v) = (a^) v_1 (l — a s ), where a % s = 
eM-(S-{i})- 

Assuming that packet s is transmitted from Qs at slot t and K s (t) is reduced by 1 at the end of the 
slot, exactly one of the following two mutually exclusive events occurs: either s is successfully received 
by i (w.p. ) or s is not received by i but is received by all users in set T — S (and erased by all 
users in M — T), so that it is placed in queue Q-j-, with T D S, due to step 0] of ACTFB1. The latter 
case occurs with probability P {^J , where 

p l s ^ r = PAf~(T-{i}),T~S- (41) 

Note that the above events occur provided that K % s (t) > is actually decreased by 1, so that the 
corresponding probabilities are actually conditional probabilities. This is the reason for the appearance 
of the term (1 — a l s ) in the denominator of both probabilities. 

We denote with T) % s l D S the index set of the queue to which the transmitted packet s is moved after 
the l-th reduction of index K l s , with 1 < / < k l s , during the processing of Qs- Obviously, this is a 
random variable (hence, the dot) that depends on the exact erasures that occurred during the slot of the 
/-th reduction. From the previous discussion, it holds Vt(V 1 s i = T) = yz^f for all T D S and the total 
number of tokens for user i that were moved into Qj- during the processing of Qs is 

k s ^ T = Y / I[V i S)l = T] (42) 
i=i 

where 

1 w.p. ?£=*f 



W.p. 1 - ^tr 



(43) 

l-a% ' 



3 for consistency, we assume that the 0-th reduction of K z s occurs at Is, i.e. when processing of Qs begins. 
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Step |4] of ACTFB1 now implies the following recursion for all S with |«S| > 2 



E = E il l [vh= s 

j^ics 0^ics 1=1 

i&S i&S 



(44) 



which captures the property that k l s (i.e. the value of K l s at the beginning of processing Q$) is equal to 
the cumulative number of tokens for user i that were moved to Q$ during the prior processing of queues 
Qx, for 1 C S. Rewriting (@4]> as 

we now state the next result. 

Lemma 11: Under the application of CODEl pn b for K = \nR], with R > 0, it holds for all S C AT 
and i € <S 

lim — = kg a.e. 

n—i>oo n 

, ^5 1 

lim -r4— = a.e. 

^ 1 — 



(46) 



where fe^ > are deterministic quantities defined through the recursive relation 

4= E i PAf-(S-{i}),S-X VS:\S\>2, (47) 



and the initial condition k\ .-, = Rj. 



Proof: Proof is by strong induction on \S\. For |<S| = 1, the initialization of the algorithm implies 
that = Ki = \nRi\, whence we conclude that k^/n — > Ri a.e. as n —> oo. Additionally, it holds 
f* {i} = Ya=1 l so that the SLLN yields 



k \i} 



fc W fc {i} 1=1 1 °W 

since k).-, — > oo a.e. as n — > oo. 

We now assume that (l46l ) is true for all 5 with < m. Applying (1431) to any 5 with |<S| = m + 1, 
taking a limit as n — > oo and using the inductive hypothesis for all Z C S (since it holds \T\ < m) and 
the SLLN (since k' x — > oo a.e. as n — ^ oo and the indicator functions are iid random variables), we arrive 
at 

lim ^ = Y I lim ^ ) E [l \V XI = S\] = V 4 = 4- (49) 
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Using (|4~T1 ) to substitute for p % x^ s , a l s , (l49l ) reduces to (l47l) for all 5 with |<S| = m + 1, so that the 
induction is complete for the first equation in (l46l ). To prove the second equation in (l46l) for all S with 
|«S| = m + 1, we follow a procedure similar to the case of |«S| = 1 so that 



^5 1 1 



^5 = E^^^r = 7-E^^E[^] = — - - a.e.asrwoc, (50) 



^5 ^5 i=l 1 a S 



This proves the second equation in (1461 ) and completes the proof. 
Using Lemma [TT] and rewriting (l39l) as 

— — = max = max — — 

n i€S \ n J ies \ k l s n 

n , ' r n 



(51) 



we can take a limit as n — > oo, use (1461 ) and exploit the continuity of max to pass the limit through it 

and arrive at the following Corollary. 

Corollary 2: Under the application of CODEl pu h, it holds 

f% ( 4 

lim — — = max ( - — ^—r I a.e. 

(52) 

a.e. 



lim — = max f 

n->oo n z — ' , ieS V 
m-/-<zr Kf > 



The last auxiliary result is an explicit solution of (|47T ) (along with the initial condition ki^ = i?j) 
which, introducing the variable 



fh = TT7] ~~ y (53) 

is cast into the more convenient form 

fs = -, fiPM-(S-{i}),s-x V5:|5|>2, (54) 

iei 

with an initial condition of ft*. = 1 _ 1 £ ^ . The following Lemma provides an explicit representation of 
and shows that f l s is identical to the quantity fg introduced in Theorem [T] 
Lemma 12: For any set S C A/" with i G <S, it holds 

/s = F ■ ( 55 ) 
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Proof: The following equivalent expression can be derived from Lemma [TOj 

PS,G = E (-^^uh = E (i-C 1 - e ^)) 

(56) 

= E + E (-i)'* l+1 u - ^ uw ) = E (-d |h|+1 (i - zsun), 

ncg ncg ncg 

where we used the binomial theorem to compute Yl,ncg(~ 1)'^' = Sr=o ('r'K - = 0- ~ ^)'^' = 
We initially manipulate (l54b by substituting for pj^_rs-{i}),S-x through (1561 ). which yields 

fs = - r - 1 E fi E (-l) |H|+1 (l-^ (5 _ W) )uw)- (57) 

Extracting the % = term from the summation over % yields 



i l ~ e M-(S-{i})) + E - e(AT-(5-{i}))uw) 

0^HQS-X 



fs — i ; E 
= - E /i+r— 7 — - E E /K-i) |w|+1 d 

{i}CXdS M (S W) 0^HCS-{i} {i}C!CS-H 



-(M-(S-{i}))uHj 



(58) 

where we changed the order of summation in the second sum of the last line. Moving the first sum in 
the RHS of the last expression to the LHS produces 



E fi = 1 _ e * - E (-i) |w|+1 (i-^( 5 -«))u«) 

{i}CICS M-[£>-it}) ^ n cS-{i} 

which provides a new recursion w.r.t. the term X]{i}cxcs fx- 

For a fixed i, we can use induction on |«S| to show the following relation 



{i}CXCS-H 



(59) 



E /i=i > VS, VieS. 



(60) 

I - fir ip r.-n 

Indeed, for |«S| = 1, which implies S = {i}, (l60b yields ft* = , which is identical to the initial 
condition of (f54b . We now assume that d60l is true for all S with |<S| < / and show that it is also true 
for all S with \S\ = I + 1. Specifically, for any S with |<S| =1 + 1, d59l becomes 

E ^=i— E (- 1 )^ l+1 (i-^-(5- W) )u^) 1 - 1 

W cxc5 1 " e ^-(^-«) ^cs-{i} 1 - e (Ar _ (5 _ {i}))uW 

1 

1 " e^_ (5 _ { i }) ' 

where we used the inductive hypothesis for the terms Yluycxcs-H fh s i nce < |<5 — < Z when 
% ^ 0, and applied the binomial theorem. This completes the induction and proves (f60b - 
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We denote with p% = VM-x,x the probability that a packet is received by exactly the users in X (and 
none other), whence we deduce the following relation 

Px = Pr I |J [E M -i n Rx) \ = ??{Em-s) = e/f-s, (62) 

which is true for any S C A/". Hence, it also holds Sxcs-ft} ^ x = e AA-(S-{i})> so that the following is 
true for all S and i G 5 

PI = e Af-(S-{i})' 

xcs-m 

~ 1 (63) 

/ , fxu{i} = 1 " ' 

along with the initial conditions p = ej\f, /j^ = 1/(1 — ejv). The second equation in (l63l) is essentially 
a rewrite of (l60l ). 

We now make the crucial observation that d63l allows for a separate recursive computation of fg, 
PS based on the corresponding initial condition. Since the only difference between the two recursions 
is the RHS term (the recursion for p x , f Xu ^ uses e A r_( (S _{ i }), (1 - e A ^_ (<s _{ i }))~ 1 , respectively), we 
conclude that any relation that holds for px also holds for f xu u\ via a substitution e^/-(5-{i}) ~~ ^ 
(1 — ej v "-(5-{i})) _1 . Combining the last statement with Corollary Q] (which provides an expression for 
PS-{i}\ yields 

(— l) |rt 

/s = f(s-{i})u{i} = i — ; > (64) 



-(5-{i} 

which completes the proof. 



S. Proof of Theorem \T] 

We are now in position to finally prove Theorem Q] Through a change of variable %' = {S — {i}) — %, 
(l55l) can also be written as fg = Ylwcs-U] ' J, — = fs- Additionally, using Lemma [T2l and (l53l) 



to substitute for kg in (1521 ) yields 

f,* /D ^ a ,. Tg I fs(l - e jv -_( S _{j})) \ 

lq(K) = iim — — = max ' L - L ^- = max tqRj), 

sy ' n^oo n ieS \ l-a s ) ies yJS h 

T *(R) 4 lim _ = £ Tj(fl) = £ max (/^), 

where we also used (|4~TT) to substitute for a^. We now show that the achievable region of CODEl pil &, in 
information symbols per transmission, is given by 

^coDEi pub = {R-T*(R) < 1}. (66) 
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The reader can verify that (l66l ) readily yields (fTTb through (I65T ). considering the fact that each sym- 
bol/packet contains L bits. Hence, it remains to prove (l66l ). which is equivalent to proving the following 
statements: 1) any R such that T*(R) < 1 is achievable by CODEl pufe , and 2) no R with f*{R) > 1 is 
achievable by CODEl pu &. 

To prove the first part of (l66l ). consider any i? 7^ with T*{R) < 1 and apply the fixed blocklength 
version of CODEl pu f, (i.e. stop after n transmissions), with K = \nR~\. By construction of the modified 
CODElp^, an error occurs iff T* > n. Hence, the probability of error for the modified CODEl pu b is 

p n (e) = Pr(T* > n) = Pr > lj = Pr - f*(R) > 1 - f^R)^ . (67) 

Letting n — > 00, the relation T*(R) < 1 implies, through (I67T ). that p n (e) — > 0, since the LHS of the 
inequality in the last event in ([67]) goes to as n — > 00, while the RHS is strictly positive. This proves 
the first part of d66l ). A similar argument can be used to show that T*(r) > 1 implies lim ?woo p n (e) = 1, 
which proves the second part of (l66l ). 

Appendix D 
Proof of Lemma[7]and Theorem[2] 

Consider an arbitrary R 6 TZ^d and define the set 

= {A; G A/" : 7r(fc) > j}, (68) 

where 7r is the permutation corresponding to R via (fT2l . Additionally, there exists the functional inverse 
7T _1 of 7r (since fr is a bijection on AO, which is a permutation on M as well. In fact, the introduction 
of 7r _1 allows us to rewrite (l68l) as 

<Mi) = {^ 1 (j),vf- 1 (j + l),...,7f- 1 (A)}, (69) 

which can be proved by standard bidirectional set inclusion. It now holds 

N N 



E max (M) = £ E fs R l = E ^-0) E 5" > ( 7 °) 

•SCAT Z=l 5:i=argmax ies (/ifl i ) J =1 <S:S- 1 (i)=argmax tes (/^ J R 1 ) 

where the last equality follows from the substitution I = 7f _1 (j). Since R G IZord, (fl2l) now implies 

j<S : argmax (/l^) = tT 1 (.?')} = : tT 1 ^ = argmin(^(i))| , (71) 



so that the inner sum in the RHS of (P70b becomes 

e /r o) = e r^ ,.^ 1 — , (72) 

5:*- 1 (i)=argmin ies (S-(i)) 5:{S- 1 (i)}C.SC<I. fr (j) A/-(* ff(i3 -{tt Hi)}) 
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where the first equality follows from the fact that, by construction, all sets S appearing in the summation 
of d72l) satisfy the relation 

Q S C {k e M : vr(fc) > ttCtt-^-))} = <Mi), (73) 



and the second equality follows from d60|) . 
The definition of ^s(j) now implies 

M-($n(j)-{K-\j)}) = {keN:m <i} = {7r- 1 (l),...,7r- 1 (i)}, (74) 
which can again be proved by bidirectional set inclusion. Inserting (1741 into d72l ) and d70l ) finally yields 

AT o 

J] max (r s R t ) = £ z , (75) 

which completes the proof of Lemma [7J 

Regarding Theorem |U we can prove that 7£ C oDEi pub ^^-ord = C out r\lZord by showing that 7£ C oDEi pilb n 
Ttord !5 C out nTZord (the inclusion in the other direction follows trivially from the fact 7£ C oDEi pub C out ). 
Indeed, pick any R € C out n ft^. Since G C out , Lemma H implies that it holds 

/ N 



max 

tt£V 



Ei — — ^ ) ' (76) 

^ 1 " e{^(l),...,7r(i)} / 



where V is the set of all possible permutations on M , so that V includes both n and 7r _1 . Hence, d76l ) 
also holds for the specific permutation 7r (corresponding to the chosen R), which implies through (l75l) 
and Theorem Q] that R € 7£ C oDEi p „ b - Since R also belongs to 7£ or d, it follows that 7£. C odei b n 72. or d 5 
C ou ' n 7£ rd- This completes the proof of the first statement in Theorem |2] 

The second statement of Theorem [2] now follows from the fact that the assumption lZ or d 2 C out (which 
also implies TZ or d 2 7£ C oDEi pub ) transforms the established relation 7£ C oDEi pub ^Tlord = C out HlZord = V 
into 7£coDEi pub = C out = V. Hence, CODEl pM ;, achieves capacity in this case. 

Appendix E 
Proof of Theorem [3] 

For symmetric channels, we introduce the notation em = e% for all I C J\f with a given It then 
holds e*i > . . . > ejy, which in turn implies yi- > . . . > x _ g . A simple index exchange argument in 
Lemma [4] reveals that C out can be written as 

C™* = \r > : V < l\ , (77) 

where 7r is the permutation on M that rearranges R in non-decreasing order, i.e. R%- im\ > . . . > R%-irm. 
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By definition of symmetric channels, it also holds 



fs 



1 — eKT-v ^ V Tn J 1 — e\r_ 

HCS-{i} yV H m=0 v 



■in 



where we used the fact that there exist x ) subsets % of <S — {?} with cardinality m. Hence, f l s is 
independent of i, so that for all R > it holds 

argmax (f^Ri) = argmax = argmin (vr(i)), (78) 

where the last equality is due to the definition of tt. Hence, it holds 1Z or d = {R ■ R> 0} since we can 
select, for each R > 0, the permutation tt = tt to satisfy (fT2l ). Since lZ or d 5 C oui , C0DEl pu b achieves 
capacity for symmetric channels and its rate region is given by (f77T ). 

In the case of one-sided fair spatially independent channels, we must show that any vector R G IZfair, 
i.e. any vector which satisfies 

ei > . . . > e N , 

(79) 

ei-Ri > • • • > e N R N , 

also belongs to TZ or d, i.e. there exists a permutation tt such that it holds arg max ieS (f^R i ) = arg min iGi5 (-7f(i 
for all S C J\f. In fact, we will show that the required permutation tt is the identity permutation; in other 
words, we will prove that (1791 implies f l s Ri > fgRj for all i,j € S with i < j. 

Consider an arbitrary set S C A/" and let i,j G 5. Using Lemma [12] and exploiting the spatial 
independence, we compute f l s as 

fi= y (-i) 1 * 1 y (zg | y _ (-^ 



m j^n (80) 

(_l)l«l (_i)N+i 



For an arbitrary set 5, define 771,5 = mm G 5}, so that it suffices to show f^ s R ms > f l s Ri for all 

S and i £ S. Since it holds, by d79l , > we will prove the desired inequality > by 

? f , 

proving the stronger inequality > -44%-, or equivalently 

e ™s Is 

tifg* >e ms fh, VS,ieS. (81) 
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We now concentrate on (1811) and manipulate it through (1801) to produce the equivalent relation 



v (-1)1*1 — erns €i — > v (-r 



\H\ 



€4 



HCS-{i,m s } " e M . s e ms e t e H HQS ^ ms} \1 - 1 - e^ s e ms e H 

y rum — e j2±^H — > y ( _i 



(82) 

Using the fact that e ms > gj and the following equality 

2 2 

1 ~~ Ti \T-> \i (°->) 



E ("^'m Tn 5 ^ a (85) 



(1 - e7v_5eie//)(l - e N _ s ^m^H (1 - gv-.seje«)(l - ^N-S^m s e n) ' 

we can write an equivalent expression to ( f82l as 

E 1 + E (-l) |W| 7i ^fn^s ? (84) 

where we also used the identity 'Ylucs—{i ms) 

(-1)1*1 =0. 

We now observe that the first term of ( f84l ) is equal to the non-negative quantity fgr ms \ so that, in 
order to prove (l84l ). it suffices to prove the second term in (f84l ) to be non-negative, namely 

e M-S t 'H t ' ltm s _ 

Eq. (I85T ) is now a special case of the following general result 
Lemma 13: For any < ot\, ct2 < 1, it holds 

E("i)'\ n %r fr T^°- (86) 

Proof: Using the geometric series X]£o z ' = V(l — 2), f° r all < z < 1, and setting z = a\ Yli^n e « 
and z = a 2 FLeW e *' vie l ds 

oo oo 

=EE«^n^ +2 - 

1=0 k=0 ieH 

Multiplying ([87]> with (-1) 1 * 1 , summing over all % C <S and using the identity flies ~~ x «) = 
S-hcs( — 1) riig-H Xi (which is easily proved by induction on |«S|) now produces 

oo oo oo oo 

lhs of m = E E E II e * +fe+2 = E E IK 1 - e ' +fc+2 ) * °> < 88 ) 

i=0 fc=0 HQS ieH 1=0 k=0 ieS 

which is the desired result. ■ 
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Appendix F 
Correctness of code2 p „;, 

The following result is a close analogue to Lemma [5] 

Lemma 14: Consider a slot t in subphase 2.1 of C0DE2 pu b, when queues Qu*,j} G Q$ u an ^ Q{i,2,3} 

are combined, and a packet s = YlpeQ . } uq {1 2 3 > a s{P)P * s transmitted. Assume that at the beginning 

of the slot (i.e. before any packet transmission), there exist sets B%\t) C {b^ : p G Qx}, for all 1 C A/" 

and / G J, and B A (t) = {b? : p G Q Di } such that B Dl {t) U (J x^cjV is a basis of F^' 1 for all 

x-«.(t)>o 

/ G A". Define 7£ {i „ J} (i) = |Z G {i*, j} : K l {i%j} (t) > V #{ 1>2j3} (i) > Oj and, for each I G K {i%j} (t), 
pick a vector 6/ as follows 

arbitrary b{,° G fig 2 3} (t) if Su(l) = 0, 
arbitrary G B^, otherwise. 

Then there exist coefficients a s (p) such that the set {b ( s } U B Di m U |J xacN 

is a basis of FL 

A"i(t)>0 

for all I G 7£ { ;, )j} (i). 

Proof: The proof is essentially a repetition of the proof of Lemma [5J the main ingredients being the 
application of Lemma [9] to show that 

/ \ 

{6^} U B Dl (t) U (J B%\t) - {&/} is basis of 



Pr 



X-.XCM 

\ K l T (t)>0 / 



>i-i, 
q 



for all I G Tlu* j\(t), and a standard probabilistic argument where a s (p) are selected iid uniformly in 
F ? . ■ 
Lemma [14] can now be used to show that Lemma [6] is also true for CODE2 jm fe. This is again proved 
by induction on each slot t. In fact, since CODE2 pu 6 is identical to CODEl^ up to (the time where 
each level 2 queue has at most one surviving index), it follows that the inductive hypothesis is true for 
all slots t < t?;, so we only need to apply induction for t > t%. Due to the queue mixing in subphase 2.1, 
the proof of Lemma [6] must be modified as follows. 

Proof of Lemma \6\for CODE2 pu f,: Assume that the inductive hypothesis holds at the beginning of 
slot t > t\ and we are currently combining Qu*j\ G Qg u with Qu^ty- We pick the coefficients for 
the packet s to be transmitted at slot t according to Lemma [14] and distinguish the following mutually 
exclusive cases for each I G TZn,j\(t) (for / Tin* t j\(t), the hypothesis holds for t+1 without changing 
any i.e. we simply select B^(t + 1) = B%\t)) 
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• if I receives s and it holds Su(l) = 0, ACTFB2 requires that KL 2 3 | is decreased by 1 and Kd 1 is 
increased by 1. We set 3} (t + 1) = B$ 2 3} (t) - {6/} and B A (t + 1) = £ D; (i) U {#} while 
all other sets remain unchanged. Lemma [14] implies that the new sets form a basis of F^ 1 ' at slot 
t + 1. 

• if I receives s and it holds Su(l) > 0, then, according to ACTFB2, Kn, -i is decreased by one and 
Kr> t increased by 1. The hypothesis still holds for user I and slot t + 1 by setting + 1) = 
B Dl (t) U {6^} and j} (t + 1) = fiS o (i) - {fy}, while all other sets remain unchanged. 

• if I erases s and k G {1, 2, 3} — receives it, CODE2p U f, requires K]^ .-. to be decreased by 1 
and K 1 ^ 2 3 | increased by one. The inductive hypothesis at t+1 is still true by setting B®. ^ (t+1) = 
Bfl J} (t) - {b { P} and Bf} 2t3} (t + 1) = B^ } (jt) U 

• in all other cases, no K l x , Ke> 1 indices change, so that sets B%\ Bd, remain the same as in slot t, 
and the hypothesis is trivially true at slot t + 1. 

Since the above list contains all possible cases, the inductive hypothesis always holds for alH G Af in 
slot t + 1 and the proof is complete. ■ 

Appendix G 
Proof of Theorem [5] 

Consider a vector R and assume without loss of generality that R > 0. As in the analysis of CODEl^;,, 
we consider a modified version with a fixed blocklength n where the transmitter creates sets of packets 
Ki with |/Cj| = Ki(R) = \nRi\, for i € {1,2,3}, and transmits n symbols. An error is declared if 
CODE2 pu ;, has not terminated by the n-th transmission. The proof is similar to that of Theorem [TJ in the 
sense that the total number of slots T* required by CODE 2^6 is computed as a random variable and it is 
seen that T* jn tends to a deterministic quantity T*(R) w.p. 1 as n — > oo, so that the achievable region 
of C0DE2 pu ;, is {R : T*(R) < 1}. Having found an exact expression for T*(R), simple algebra reveals 
the latter region to be identical to the outer bound of Lemma 01 

We denote J\f = {1, 2, 3} while is the (random) number of time slots it takes CODE2 pu b to process 
queue Q$, so that T* = X^^scaT ^s- Since CODE2 pu b is identical to CODEl pu ;, until the end of phase 
2 (i.e. when each level 2 queue has at most one non-zero K index), we conclude that all level 1 queues 
are processed identically to CODEl pu b, so that Corollary |2] implies, through the appropriate substitutions 
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We now make the following crucial observation regarding the decision taken by CODE2 pu f, at the end 
of phase 2 (denoted as t%). Depending on the exact values of Su(i), the following cases exist: 

• if Su(i) = for all i G N, or Su(i) = 1 for all i 6 N, CODE2 pM fc continues mimicking CODEl pu b 
until the end of the algorithm. In this case, the asymptotic behavior of TJ is obviously still governed 
by Corollary |2l 

• otherwise, CODE2 pu f ) deviates from CODEl^ by further processing each level 2 queue Qs in 
subphase 2.1 mentioned in Section[VTJ An inspection of the ACTFB2 procedure indicates that, during 
the combining of a level 2 queue Qs with Q/j, the actions regarding indices K l s are identical to 
ACTFB1 (in fact, the only difference between ACTFB1 and ACTFB2 lies in the handling of indices 
Kj*/). Since each level 2 queue is still processed until all its K indices become zero, we conclude 
that, if we denote with T s the total number of slots required for the processing of Qs during phase 
2 and subphase 2.1, Corollary [2] still holds. However, the value of Kfa at the beginning of phase 3 
will be different than the corresponding value under CODEl pu b due to the interjection of subphase 
2.1. 

Denote with £3 the beginning of phase 3, equivalently the end of phase 2 or subphase 2.1 (if the 
latter occurred). Since CODE2 pit 6 again mimics CODEl pu b during phase 3, Corollary [2] implies, under 
the obvious substitutions, that 

1 



lim — ^ = max 

n->oo n i&M 



lim 



a.e., (90) 



n—>oc y n I 1 — t 
provided that the rightmost limit exists w.p. 1 (this will be shown later). It then follows that 

lim — = V max( fLRA + lim a.e., (91) 

S:\S\<2 

so that we hereafter concentrate on the computation of the last limit, which clearly depends on the specific 
decision at t\. 

Denote with T* s the number of slots it takes CODEl pM 6 (or CODE2 pn b, if we consider both phase 2 
and subphase 2.1) to process a level 2 queue Qs until K l s becomes 0. It clearly holds TJ = maxjgs T* s ; 
if we also define T s = mmi e sT* s , we can combine Lemma [TT] and Corollary |2] to deduce 

T f 

= lim = min (f s Ri) a.e., (92) 
n^oo n ies 



in addition to 



f* s = lim ^ = m 

fi = lim = max (f s Ri) a.e., 
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which we already used in d9T| ). 

We next find an expression for K^it^), for all S with |«S| = 2, since this will affect the branching 
decision made by CODE2 pu {, at t^. The following relation is true for all S with |<S| = 2 and describes 

the total decrease of each K index of a level 2 queue in the interval [t* 

ft 

K l s (t* 2 ) = K l s (tl) -^I[K l s reduced by 1 during Z-th slot of processing Qs in phase 2]. (94) 

i=i 

Dividing by n and using (|92l we conclude that 

(t* ) I (t*)\ I \ 

k\ s = lim sK 2) = lim 51 l} - lim -£ Pi(Kx reduced by 1 during proc. Q s ) 

n->oo n ^n-*oo n y ^n->oo n y ^ 

= fs R ii l - e JV-(s-{i})) ~ min (/s^X 1 - ejV-(5-{i})) a - e - v5 : l 5 l = 2 > 

t Go 

where we used Lemma ITTI (which is still applicable at t*) for the asymptotic behavior of K s {t\)/n. The 
subscript 2 emphasizes that the quantity refers to a limit of a random variable at i|. 
For 5 = (1951 ) can be written as 



K). . x (t%) 

n->oo n ^>\ l JS 



which motivates us to define 

where [x] + = max(x,0) and we explicitly state the R dependence of ^Lfj.- The binary relation i y j 
is introduced to denote the inequality f{ijyRi > fhjyRj (equivalently, ^{jjj > 0) which, using the 
definition of f l s , can be expanded to 

fu 4 \Ri > fl x Ri & Ri ( — ) > Ri ( — ) ■ (98) 

We also write « >r j iff f\jj}Ri — anc ^ * X J if f\ij}Ri = f{ij}^j (note that all relations 

^, x implicitly depend on R), whence the following result follows. 

Lemma 15: Consider any R > and distinct i,j, k G A/". If i y j and j y k, then i >- k. Similarly, 
if i y j and j y k, then i y k. 

Proof: We prove by contradiction only the first part since the second one follows similarly. We 
assume that k y i, so that it holds 

I\,yM, > f j H n R 3 & Rt (- —!—) > Rj (- —!—) , 

1 1 \ _ / 1 1 



//, fc> i?i > /£■ fe} #fc & Rj --. >R k [ z z , (99) 

/ { \ fc}J R fc > r {i>k} Ri o f ) > ^ f ) . 
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The terms in parentheses above are non-negative by construction. In fact, the term , — ^-J — is 

positive, since otherwise we would conclude that is strictly larger than a non-negative number. We can 
then use a similar reasoning and the fact that R > to show that all terms in parentheses are positive. 
Hence, we can multiply the 3 equations by sides and arrive at a contradiction that a number is strictly 
larger than itself. ■ 
Using the notation of (l43l) . we can find the value of K 1 ^ at i| as 



\nRi] T| 

km) = E = w + E X>fe = ( 10 °) 



= 1 5:ig5 J=l 

151=2 



where the first, second term is the number of tokens moved during phase 1, 2, respectively. Using a 
procedure similar to Lemma ITTl we can find 



k\ Kf = lim 



^1) = Ri Pr{ n h =ao + E 4) p ^5 = AO 



2JV 

77, L J ' — ' \ n->oo 77, 

5:i£5 

|5|=2 (1Q1) 



f{i}RiP{i},N-{i} + E ™™(fs R l)PirN--S- 



5:iG5 
|5|=2 

Any variation of between t| ( en d of phase 2) and i 3 (beginning of phase 3) under C0DE2 pu b can 
only be due to subphase 2.1 or the continuation of processing level 2 queues if Su(l) = 1 for all I G M. 
Hence we conclude: 

#V(*2) if Su(l) = V/gA/", 



kif{tl) + Es-.ies E2i Ti l[D% i = A/"] if 5«(i) = 1 Vi G A/, 

|5|=2 



+ E5: i6 5 ^ ,] - E5: i6 5 E£l ^ 

|5|=2 |5|=2 



otherwise, 

(102) 

where IfDj J = I[Kj^ increased during l-th slot of processing Q$ in subphase 2.1] with a similar defi- 
nition for I[-DjJ (replacing increased with decreased). 

At this point, it is convenient to consider the following two complementary cases and individually 
examine each of them. 



• it holds r l s = for all S with |<S| = 2 and I E S. Equivalently, it holds i x j x k. 

• it holds r l s > for at least one I G S with |<S| = 2. 
7) The case ixjx fe: Equations (|92l , d93l imply that 



lim — = lim — — — = a.e., (103) 

n— >oo 77 n— >oo 77, 
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so that, examining all 3 cases in (11021 ). we conclude that 

lim sm _ lim «m (104 ) 

n— >oo 77 n—>oc 77 

which implies, through d90l . (1911 ). that CODEl pu b and CODE2 pu b have the same asymptotic performance 
(meaning that T*(R) = Imin^^T* / n is the same function under both algorithms) for all R such that 
i >c j >; k. Hence, defining the set TZ = {R > : i x j x k}, we conclude 7£ C oDEi pub H 7£ = 
^•coDE2 pui , H TZ. Furthermore, it holds TZ C TZ or d, where 1Z or d was defined in < fT2b . so that 

C out n ^ = C out n ^ n ^ = ^ C0DEl!)ub n rc ord n K c ^ C0 DE2 pu6 n ft, (105) 

where the last set equality is due to Theorem |2 Hence, CODE2 pn !, achieves all rates in C out n 7\L 

2) The case r l s > for at least one I E S with \S\ = 2: Let S = {i,j} and assume r l s > 0, so 
that i >~ j. Then, two mutually exclusive cases exist according to Lemma [T51 (in the following, are 
distinct): 

• it holds k y i, so that k >~ j. 

• it holds i >~ k. 

In the first case, it follows from d96l ) that it holds w.p. 1 

lim 2J > 0, lim -Mill = o, (106) 

n— ¥oo n n— >oo 77 

lim — > 0, lim — ^ = 0, (107) 

n— ><x 77 n— >oo 77 

so that, but the definition of limit, there exists some m such that for all n > n\ it holds Su(i) > 
1, Su(k) > 1, Su(j) = 0. In the second case, (1106b is still true and it also holds 

lim > o, lim {t ' fc}l ^ = 0, (108) 

n— >oo 71 ra— 5>oo 77 

which implies via a similar argument that there exists some 712 such that Su(i) = 2, Su(j) < 1, Su(k) < 
1, for all n > ri2. 

Hence, in both cases there exists a sufficiently large no such that for all n > no, the first two branches 
in (11021 ) are excluded. Hence, it holds 



lim 

71— i>oo 77 



lim ^MMl + £ (7*^ - mm (/J^)) P W ^-S 



5:ig5 
|.S|=2 

(109) 



£ f max (/p?,) - (1 - Ci 



|5|=2 



a.e., 
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which can also be written as 



n 



Km 36® + Y, l[r% > 0] P{i} ,M- S - £ I[4 = 0](1 - a) 

n— ¥00 fi z — J z — J 



s-.ies S:ieS 

\S\=2 \S\=2 



(110) 



It is now a matter of case distinction, depending on the values of r % s , and simple algebra to verify that 
CODE2p u6 also achieves all rates in C out n TZ C , so that it achieves C out . 
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